Pneumococcal serotypes

ABSTRACT

Disclosed is a new and emerging serotype of  Streptococcus pneumoniae  designated serotype 6C, and assays and monoclonal antibodies useful in identifying same. Also disclosed is a novel pneumococcal polysaccharide with the repeating unit {→2) glucose 1 (1→3) glucose 2 (1→3) rhamnose (1→3) ribitol (5→phosphate}. This new serotype may be included in pneumococcal vaccines.

RELATED APPLICATIONS

This application is related to and claims the benefit of U.S.Provisional Patent Applications No. 60/754,354, filed 28 Dec. 2005 andNo. 60/796,139, filed 28 Apr. 2005.

STATEMENT OF FEDERALLY SPONSORED RESEARCH

This invention was funded, in part, by the United States FederalGovernment under NIH Contract No. AI 30021. Accordingly, the FederalGovernment may have certain rights in this invention.

FIELD OF THE INVENTION

This invention relates to bacteriology, immunology, and epidemiology.More specifically, this invention relates to new and emerging serotypesof Streptococcus pneumoniae and assays and monoclonal antibodies usefulin identifying these serotypes.

BACKGROUND

Streptococcus pneumoniae is a well known human pathogen and a majoretiologic agent for pneumonia, meningitis, otitis media as well assepsis, among primarily young children and older adults. S. pneumoniaehas been divided into ninety serotypes based on its expression ofserologically distinct carbohydrate capsules. Antibodies to a capsularpolysaccharide (PS) may provide protection against pneumococciexpressing the same capsular serotype. Currently available pneumococcalvaccines contain a mixture of capsular PS of multiple serotypes. Forexample, one pneumococcal vaccine (called PS vaccine) contains capsularPS from twenty-three commonly found serotypes. The most recentlydeveloped type of vaccine (called conjugate vaccine) contains capsularPS from seven to thirteen serotypes that are conjugated to a proteinmolecule. A seven-valent conjugate vaccine was introduced in 2000 forclinical use in the USA and has reduced the incidence of invasivepneumococcal diseases in children and in adults.

The distribution of pneumococcal serotypes is useful in estimatingvaccine efficacy. Ideally, an effective pneumococcal vaccine wouldreduce the prevalence of pneumococci expressing the serotypes includedin the vaccine and leave the prevalence of the pneumococci expressingnon-vaccine serotypes the same. In reality, the prevalence of thepneumococci expressing non-vaccine types increases to replace thoseexpressing the vaccine serotypes. Further, the prevalence of specificserotypes may change over time for unknown reasons. Consequently,accurate and efficient serotyping of pneumococcal isolates is importantfor monitoring the efficacy of pneumococcal vaccines. Indeed,identifying emerging pneumococcal serotypes remains a crucial goal inpublic health.

To that end, although current polyclonal antibodies are useful inidentifying and monitoring pneumococcal serotypes, there remains a needfor improved identification assays that might take advantage ofmonoclonal antibody technology and the need to identify new serotypes.

SUMMARY OF THE INVENTION

An embodiment presented herein provides for the identification of a newand emerging pneumococcal serotype and means for identifying same. Morespecifically, the present invention provides for a novel pneumococcalserotype closely related to serotype 6A, identified herein as serotype6Aβ or 6C (which are synonymous).

An additional feature provides for an isolated culture of a bacteriumdesignated Streptococcus pneumoniae 6C.

Another embodiment provides for a novel polysaccharide with therepeating unit {→2) glucose 1 (1→3) glucose 2 (1→3) rhamnose (1→3)ribitol (5→phosphate}.

Another feature provides for monoclonal antibodies (mAbs) useful inidentifying emerging pneumococcal serotypes. Thus the present inventionprovides for monoclonal antibodies useful for distinguishing serotype6C, identified here as mAb Hyp6AM3, mAb Hyp6AM6, and mAb Hyp6AG1.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts the results of an inhibition ELISA. Antibody bound(Y-axis) against dilution of pneumococcal lysates (X-axis). Lysatesinclude two 6Aβ isolates (solid symbols with continuous lines), three6Aα isolates (open symbols with dotted lines), and two 6B isolates(dashed connecting lines). Antibodies used for the assay were Hyp6AG1(Panel A), Hyp6AM3 (Panel B), rabbit Pool serum Q (Panel C) and rabbit“factor 6b” serum (Panel D).

FIG. 2 depicts opsonization assay data with various pneumococci. Thenumber of surviving bacteria measured as a percentage of the bacteriaadded to the reaction well at the beginning of the opsonization assayreaction (Y axis) was plotted against the dilution of a human serum(X-axis) used in an opsonophagocytosis killing assay. The assay usedvarious pneumococci including a 6β isolate (open circle), two 6Aαisolates (open square, open triangle), and seven 6Aβ isolates (datapoints connected with dashed lines). The seven 6Aβ isolates includethose from Brazil, Korea, and USA.

FIG. 3 illustrates an opsonization titer comparison. Opsonization titeragainst a 6A subtype (Y-axis) vs. opsonization titer against 6B serotype(X-axis). Circles and triangles indicate opsonization titers against 6Aαor 6Aβ respectively. The study used sera from twenty adults who were notvaccinated (left panel) or twenty adults who were vaccinated (rightpanel) with a conjugate vaccine (solid symbol) or a 23-valentpolysaccharide vaccine (open symbol). There were ten persons in eachvaccine group. The detection limit of the assay is 4 and a sample withundetectable opsonization titer was assigned to have a titer of 2. Whenthere were multiple data points at one spot, data points wereartificially spread out to show the number of data points.

FIG. 4 presents DNA sequences of a part of the wciP gene from variouspneumococcal isolates.

FIG. 5 is a photograph of an agarose gel showing PCR products obtainedwith nine 6Aα isolates (lanes 1-9) and six 6Aβ isolates (lanes 10-15).Two lanes marked M were loaded with a DNA size marker. The two lanesshow that molecules in the right side of the gel moved faster than thosein the left. The two marker bands above and below the pneumococcal PCRproducts are 2.036 Kb and 1.636 Kb long respectively. The 6Aα and 6Aβyielded PCR products that were about 2 Kb and 1.8 Kb long respectively.

FIG. 6 presents a diagram of wchA, wciN, wciO region of the pneumococcalcapsule gene locus of isolates AAU9 (middle bar) and ST745 (bottom barin two pieces). For comparison, the top bar shows a diagram of wchA,wciN, wciO region of pneumococcal capsule gene locus based on CR931638(a GenBank entry). Genes wchA, wciN, and wciO are labeled above the topbar along with their lengths. Nucleotide sequence positions wereindicated below the top bar and the sequence position 1 shown herecorresponds to the sequence position of 4902 of CR931638. The ST745strain sequence is 193 base pairs short and the shortage was shown as agap between position 2398 and 2591.

FIG. 7 depicts the carbohydrate composition (Panel A) of capsular PSfrom 6Aα (top) and 6C (6Aβ, bottom) before and after periodatetreatment. The monosaccharides are identified in the top chromatogram.In this GLC analysis, a monosaccharide can produce multiple peaks withcharacteristic retention times and relative proportions. For instance,galactose should have three peaks: first peak (short), second peak(tallest), and third peak (intermediate). Panel B shows normalized peakareas of each monosaccharide for 6Aα (grey bar) and 6C (6Aβ, black bar).The peak areas of all monosaccharides from each PS are normalized to thepeak area of the associated rhamnose. The 6C (6Aβ) shows no galactosepeaks but has twice as much glucose as 6Aα does.

FIG. 8 depicts the mass spectrum of the repeating units of 6Aα (Panel A)and 6Aβ (Panel B) and their daughter ions (Panels C and D respectively).Mass to charge ratio (m/z) was rounded off to two decimal points.

FIG. 9 shows the mass spectrum of the repeating unit of 6Aβ PS afteroxidation and reduction (Panel A) and their daughter ions (Panels B andC). The sample used for Panel B was reduced with NaBH₄ and that forPanel C was reduced with NaBD₄. Mass to charge ratio (m/z) was roundedoff to two decimal points. R1 and R2 (in Panels B and C) indicate thatthe peaks correspond to ions derived by reverse fragmentations. Numbersfollowing the delta symbol indicate the m/z unit differences between thepeaks and associated with the names of the fragments. All the peaks inPanel C correspond to the peaks in Panel B except for a peak at 136.98,which was not reproduced in Panel B and may be a contaminant.

FIG. 10 presents the proposed chemical structures of 6C capsularpolysaccharide and the structure of its cleavage products. Proposedstructure of the 6C repeating unit is shown in Panel C. Panels A and Bshows possible molecular ions if the phosphate group is attached toribitol and if the phosphodiester is linked to the second carbon ofglucose 1. Panels D, E, and F indicate potential cleavage patterns ofthe repeating unit if the phosphodiester is linked to the second (PanelD), the fourth (Panel E), or the sixth carbon (Panel F) of glucose 1.Hydrated forms are shown and the residues involved in hydration areshown in parentheses. Periodate sensitive sites are shown in bold andcleavage products are shown in Panels A and F. Potential molecular ionsare shown with dotted lines with arrows along with their atomic massunits. Gx and Gy are potential glucose 1 fragments and Rx is theremaining ribose fragment after oxidation and reduction reactions. Theiratomic mass units are shown in parenthesis.

FIG. 11. This figure depicts the wciN region exchange experimentdiagram: In step A, wchA/wciNα/wciO-P region of TIGR6A4 was replacedwith Cassette 1. Cassette 1 has three parts (central core and twoflanking regions) and each part is about 1 kb long. The central core hasantibiotic susceptibility genes, kanR and rpsL⁺. The two flankingregions were made with wchA and wciO-P regions from AAU33 strain. Instep B, Cassette 1 in TIGR6AX was replaced with Cassette 2. Cassette 2has wciNβ gene, wchA and wciO-P regions from a 6C strain (CHPA388).TIGR6C4 shows the final product that is obtained after Cassette 2 isinserted. XbaI and BamHI sites in the PCR primers, which were introducedto simplify genetic manipulations, were shown.

FIG. 12 shows the electrophoresis pattern of the PCR products of wciNregion of 6A and 6C isolates. Primers used for the PCR were 5106 and3101, which are located in wchA and wciO genes respectively. Lanesmarked M has DNA ladders. Standard markers with 2000 and 1650 bps wereindicated in the left. Lanes 1-13 contain PCR products of 6C isolates,which are CHPA37 (lane 1), CHPA388 (lane 2), BG2197 (lane 3), BZ17 (lane4), BZ39 (lane 5), BZ86 (lane 6), BZ650 (lane 7), KK177 (lane 8), CH66(lane 9), CH158 (lane 10), CH199 (lane 11), MX-67 (lane 12), and ACA-C21(lane 13). Lanes 14-18 contain PCR products of 6A isolates, which areCHPA67 (lane 14), CHPA78 (lane 15), BZ652 (lane 16), KK58 (lane 17) andAAU33 (lane 18).

FIG. 13 present the nucleotide sequence of wciNβ ORF along with thenucleotide sequences of the 3′ end of wchA and the 5′ end of wciO genes.The potential amino acid sequence of wciNβ ORF is shown below thenucleotide sequence. Also shown are putative termination sites of wchAand wciNβ as well as putative initiation sites of wciNβ and wciO genes.The wciO gene has two potential initiation sites.

FIG. 14 shows the DNA sequences of wciNα and wciNβ regions of a 6Astrain (GenBank CR931638) and a 6B strain (CHPA388). The sequence of thenon-homologous mid-region of wciN (about 900-1110 bases) is not shown.Sites of PCR primers (5106, 3101, 5114, and 3113) are shown. Also shownare potential termination sites of wchA and wciNβ; and potentialinitiation sites of wciNβ and wciO.

FIG. 15 presents the genetic map of the capsule gene loci surroundingthe wciN gene of 6A and 6C isolates. The map shows wchA (hatched), wciN(horizontal bars or black), wciO (checkered), and wciP (wavy) genes. The6A locus has two unexpressed DNA fragments (indicated with arrows) inthe upstream of (95 bases long) or downstream (312 bases long) to thewciNα gene. An alternative initiation site for wciO gene is 32 basesupstream to the initiation site shown (position 2721 for 6A). For 6Cisolates, old DNA (1222 bases, region with horizontal bars) in wciNαregion is replaced with a new DNA (1029 bases, black region). Thereplacement creates a new ORF (named wciNβ) that has 1125 bases.

FIG. 16 depicts the capsule gene locus of 6A (GenBank CR931638) and 6Cstrain (CHPA388). All ORFs involved in the capsule synthesis are shownas horizontal arrows and their direction indicates the transcriptionalorientation. For both 6A and 6C loci, the putative transcriptioninitiation sites (bent arrow) and putative termination sites (verticalline with a solid circle) are identified. “Transposase” sequences (blackboxes, labeled “tnp”) are found in either ends of the capsule genelocus.

FIG. 17 presents the DNA sequence of the 6C serotype (isolate CHPA388)capsule gene locus.

DETAILED DESCRIPTION OF THE INVENTION

It should be understood that this invention is not limited to theparticular methodology, protocols, and reagents, etc., described hereinand as such may vary. The terminology used herein is for the purpose ofdescribing particular embodiments only, and is not intended to limit thescope of the present invention, which is defined solely by the claims.

As used herein and in the claims, the singular forms “a,” “an,” and“the” include the plural reference unless the context clearly indicatesotherwise. Thus, for example, the reference to an antibody is areference to one or more such antibodies, including equivalents thereofknown to those skilled in the art. Other than in the operating examples,or where otherwise indicated, all numbers expressing quantities ofingredients or reaction conditions used herein should be understood asmodified in all instances by the term “about.” The term “about” whenused in connection with percentages may mean±1%.

All patents and other publications identified are expressly incorporatedherein by reference for the purpose of describing and disclosing, forexample, the methodologies described in such publications that might beused in connection with the present invention. These publications areprovided solely for their disclosure prior to the filing date of thepresent application. Nothing in this regard should be construed as anadmission that the inventors are not entitled to antedate suchdisclosure by virtue of prior invention or for any other reason. Allstatements as to the date or representation as to the contents of thesedocuments is based on the information available to the applicants anddoes not constitute any admission as to the correctness of the dates orcontents of these documents.

Unless defined otherwise, all technical and scientific terms used hereinhave the same meaning as those commonly understood to one of ordinaryskill in the art to which this invention pertains. Although any knownmethods, devices, and materials may be used in the practice or testingof the invention, the methods, devices, and materials in this regard aredescribed here.

Streptococcus pneumoniae are Gram-positive, lancet-shaped cocci(elongated cocci with a slightly pointed outer curvature). Usually theyare seen as pairs of cocci (diplococci), but they may also occur singlyand in short chains. When cultured on blood agar, they are alphahemolytic. Individual cells are between 0.5 and 1.25 micrometers indiameter. They do not form spores, and they are non-motile. Like otherstreptococci, they lack catalase and ferment glucose to lactic acid.Unlike other streptococci, they do not display an M protein, theyhydrolyze inulin, and their cell wall composition is characteristic bothin terms of their peptidoglycan and their teichoic acid.

S. pneumoniae is a well known human pathogen and a major etiologic agentfor pneumonia, meningitis, otitis media as well as sepsis, amongprimarily young children and older adults. Fedson & Musher in VACCINES2nd ED. (Plotkin & Mortimer eds., W.B. Saunders Co., Philadelphia, Pa.,1994). A capsule composed of polysaccharide completely envelops thepneumococcal cells. During invasion the capsule is an essentialdeterminant of virulence. The capsule interferes with phagocytosis bypreventing C3b opsonization of the bacterial cells. Anti-pneumococcalvaccines are based on formulations of various capsular (polysaccharide)antigens derived from the highly-prevalent strains.

S. pneumoniae has been divided into ninety serotypes based on itsexpression of serologically distinct carbohydrate capsules. Henrichsen,33 J. Clin. Microbial. 2759-62 (1995). Antibodies to a capsularpolysaccharide (PS) may provide protection against pneumococciexpressing the same capsular serotype. Currently available pneumococcalvaccines contain a mixture of capsular PS of multiple serotypes. Forexample, one pneumococcal vaccine (called PS vaccine) contains capsularPS from twenty-three commonly found serotypes. Robbins et al., 148 J.Infect. Dis. 1136-59 (1983). The most recently developed type of vaccine(called conjugate vaccine) contains capsular PS from seven to thirteenserotypes that are conjugated to a protein molecule. Wuorimaa & Kayhty,56 Scand. J. Immunol. 111-29 (2002). A seven-valent conjugate vaccinewas introduced in 2000 for clinical use in the United States, and hasreduced the incidence of invasive pneumococcal diseases in children.Whitney, 348 N. Engl. J. Med. 1737-46 (2003).

Accurate efficient serotyping pneumococcal isolates is important formeasuring the efficacy of pneumococcal vaccines. Following theintroduction of a new pneumococcal vaccine, the vaccine-inducedantibodies provide serotype-specific protection. Hence, pneumococciexpressing the serotypes included in the vaccine become less commonwhile the prevalence of the pneumococci expressing non-vaccine types maystay the same. In some cases, pneumococci expressing the non-vaccinetypes replace those expressing the vaccine serotypes and the prevalenceof non-vaccine types may become higher. Pelton, 19(1) Vaccine S96-S99(2000). Further, the prevalence of serotypes can change over time forunknown reasons. Finland & Barnes, 5 J. Clin. Microbiol. 154-66 (1977).Because these changes influence the clinical effectiveness of a vaccine,serotyping of a large number of pneumococcal isolates is an importantpart of monitoring pneumococcal vaccines.

Moreover, regarding S. pneumoniae serotype 6A, current vaccineformulations do not carry a 6A PS, but carry the 6B PS because theantibodies raised against 6B are thought to cross react against 6A. Thisphenomenon, however, is not one-hundred percent: Some vaccines thatinclude the 6B PS do not raise antibodies against 6A. Yu et al., 180(5)J. Infect. Dis. 1569-76 (1999). Indeed, it appears that non-vaccineserotypes such as 6A are still causing disease in vaccinated children.Clover & Klein, Strategies for Prevention and Treatment of PneumococcalDisease, 44th Ann. ICAAC Meeting (Washington, D.C., 2004). Hence, theemergence and importance of additional 6A serotypes may become even moreimportant.

Further, the 6A and 6B serotypes account for 4.7 percent and 7 percent,respectively, of invasive pneumococcal diseases. Robbins et al., 148 J.Infect. Dis. 1136-59 (1983). Because of its medical importance, themolecular nature of serotype 6A and its related serotype 6B has beenstudied extensively. Biochemical studies found serotypes 6A and 6B PS tocomprise linear polymers of a repeating unit containing fourmonosaccharides: rhamnose, ribitol, galactose, and glucose. Kamerling,in S. PNEUMONIAE: M OLECULAR BIOLOGY & MECHANISMS OF DISEASE 81-114(Tomasz, ed., Mary Ann Liebert, Inc, Larchmont, N.Y., 2000). The two PSmay be identical except for a difference in linking rhamnose to ribitol.More specifically, the 6A PS has 1→3 rhamnose to ribitol linkage and the6B PS has 1→4 rhamnose to ribitol linkage.

Genetic studies report that pneumococci expressing either serotype havealmost identical capsule gene locus (CGL) of about 17.5 Kb in size.Sequence information is available on-line at, for example, the SangerInstitute's Sequencing Genomics Projects site. A consistent differenceexists in the wciP gene that encodes for rhamonosyl transferase.Mavroidi et al., 186 J. Bacteriol. 8181-92 (2004). The serotype 6A wciPgene encodes serine at residue 195 but the serotype 6B gene encodesasparagine at that residue. Id. It is presumed that the rhamonsyltransferase with serine makes 1-3 linkages and that with asparaginemakes 1-4 linkages.

Although there are various other serotyping methods well known in theart, the classical method is called quellung (Neufeld) method; thecurrently used methods are largely manual, slow, and tedious to perform.An improved serotyping assay named “multibead assay” is based on amultiplexed immunoassay that can be semi-automatically performed with aflow cytometer. Park et al., 7 Clin. Diagn. Lab. Immunol. 486-89 (2000).The multibead assay specificity has been fully established usingpneumococcal strains representing all ninety known serotypes. Yu et al.,43(1) J. Clin. Microbiol. 156-62 (2005). This assay provides superiorspecificity because the assay uses many mAbs specific for pneumococcalcapsular PS. In addition, the multibead assay is largely automated andcan provide a high throughput. Consequently, the assay may be useful inmany epidemiologic studies.

The multibead assay is particularly advantageous because monoclonalantibodies are more specific than polyclonal reagents. Regarding 6Aserotypes, although most “6A” isolates (defined by quellung reaction andpolyclonal reagents) reacted with 6A-specific monoclonal antibodies(Hyp6AG1, Hyp6AM6, and Hyp6AM3), some “6A” isolates reacted with one mAb(Hyp6AG1) but not others (Hyp6AM6 or Hyp6AM3). Other tests describedherein confirmed that the 6A isolates that did not react with Hyp6AM6 orHyp6AM3 were a previously unidentified 6A subtype. In other words, themonoclonal antibodies recognized subtypes within the 6A serotype. SeeLin et al., 44(2) J. Clin. Microbiol. 383-88 (2006). The inventorspreviously labeled the isolates reacting with both mAbs as 6Aα and thosereacting with only Hyp6AG1 as 6Aβ, but subsequently and herein proposethat the 6Aα remain 6A, and the new serotype be identified as 6C ratherthan 6Aβ. In other words, although both 6Aβ and 6C are used herein todesignate a novel pneumococcal serotype, they are equivalents. As such,serotype 6C represents the ninety-first pneumococcal serotype. Indeed,“6A” refers to isolates typed as “6A” by quellung reaction and includesboth 6A(6Aα) and 6C(6Aβ).

A consideration in defining a new serotype is its bindingcharacteristics with human antibodies. Because human antisera generallyhave non-opsonic antibodies binding to pneumococcal antigens other thancapsular PS, opsonization assay is more specific than ELISA. Also,opsonization capacity is more directly related to immunoprotectionagainst pneumococcal infections. Hence, the various 6Aα, 6Aβ, and 6Bisolates were compared using an opsonization assay and a human serumwith a high level of anti-6B antibodies. Although the human serumopsonized 6B as well as 6Aα (FIG. 2), it did not opsonize sevendifferent 6Aβ isolates from Brazil, Korea, and the United States (FIG.2). Taken together, these data indicate that 6Aβ isolates displaydistinct but uniform serological characteristics.

Genetic studies also confirmed that the 6C isolates were, indeed,members of the 6A serotype (rather than the closely related 6B serotypeor some other unrelated serotype). In a study of ten isolates collectedfrom Brazil, Korea, and the United States, all ten isolates identifiedas 6Aβ had the serine at residue 195, consistent with the wciP gene inserotype 6A. DNA sequences of the wciP gene of several pneumococcalisolates are presented in FIG. 4.

The genetic sequences of transferase genes wciN and wciO were alsocompared. When wciN region was examined by PCR using primers 5016 and3101 (5106: 5′-TAC CAT GCA GGG TGG AAT GT (SEQ ID NO:1) and 3161: 5′-CCATCC TTC GAG TAT TGC (SEQ ID NO:2), all nine 6Aα isolates examinedyielded about 200 base pair (bp) longer product than did all six 6Cisolates examined (FIG. 5). The six isolates included 6C isolates fromKorea, USA, and Brazil. Thus, this PCR can be used as a genetic test for6A subtypes.

The nucleotide sequences of the PCR products from one 6Aα isolate (AAU9)and one 6C isolate (ST745) were then compared (FIG. 6). All the basesbetween positions 1203 to 2959 (1757 bases) in AAU9 PCR product weresequenced and the sequence was found to be homologous to CR931638, whichis the capsule locus sequence of a 6A isolate reported in the GenBankdatabase. In contrast, the ST745 sequence was found to be almostidentical to that of 6Aα up to position 1368, and then again startingfrom position 2591. The intervening 1029 by sequence (from 1369 to 2397)is quite different from that of 6Aα. The intervening sequence containsabout 98 bp that is similar to a transferase (EpsG) used forpolysaccharide synthesis by Streptococcus thermophilus.

Thus, the work presented herein supports the genetic basis for the newpneumococcal serotype. The capsule gene locus of 6C is very similar tothe 6A locus except for the wciN gene: 6A strains have the wciNα gene,but 6C strains have the wciNβ gene, which is dramatically different(with only about 50% homology) from the wciNα gene. Because the twogenes differ in sizes, 6A and 6C serotypes can be readily distinguishedby PCR. The wciNα gene encodes WCINα with 314 amino acids, while thewciNβ gene produces a 1125 base-long ORF and its product, WCINβ has 374amino acids. These two proteins have little homology at the amino acidlevel.

Sequences of the putative wciN gene products suggest their glycosyltransferase functions. WCINβ has similarity to the staphylococcal capHgene product and has a 160-amino acid-long transferase domain thatbelongs to glycosyl transferase group 1 family. In contrast, WCINαbelongs to glycosyl transferase family 8 (ex), which includes manygalactosyl transferases. Campbell et al., 326 Biochem. J. 929-39 (1997).These observations are consistent with the chemical structures of the 6Aand 6C capsular PSs and support the contention that wciN is responsiblefor the differences between the 6A and 6C serotypes. Indeed, a 6A straincan be converted to a 6C strain by replacing the wciNα gene with thewciNβ gene.

The galactose/glucose exchange observed for 6A and 6C is found for otherpneumococcal serotypes. The 9L serotype PS of pneumococcus has agalactose molecule, but 9N PS has a glucose molecule. The capsule geneloci of the 9L and 9N serotypes resemble each other but differ in onegene, wcjA, which encodes a galactosyl transferase for 9L and a glucosyltransferase for 9N. The wcjA genes of the 9L and 9N serotypes are verysimilar; it is likely that one arose from the other by mutation. Incontrast, the wciNα and wciNβ genes are very different, and the wciNβgene is not homologous to any other pneumococcal genes available indatabases. Perhaps, the wciNβ gene may have originated from an organismother than pneumococci. In support of this hypothesis, an examination ofthe wciNβ gene shows two flanking regions, which may have participatedin homologous recombination and which are known to be critical forhomologous recombination in pneumococci. Prudhomme et al., 99 P.N.A.S.USA 2100-05 (2002). Additionally, studies of antibiotic-resistance geneshave shown horizontal gene transfers between S. pneumoniae and anotherbacterial species. See, e.g., Feil et al., 151(6) Res. Microbiol. 465-69(2000); Muller-Graf et al., 145(11) Microbiol. 3283-93 (1999); Coffey etal., 5(9) Mol. 2255-60 (1991).

The source of the wciNβ gene is not yet known. A part of the wciNβ geneis similar (81% homology) to the EpsG gene, a gene involved in thesynthesis of exopolysaccharide by S. thermophilus. The homology is foundfor only a very short piece of DNA, however, thus, S. thermophilus maynot be the source for wciNβ. The protein sequence of WCINβ resembles thewaaG (rfaG) gene product of E. coli K-12 strain and some pneumococcalgenes may have come from Gram-negative organism. Thus, it is possiblethat the wciNβ gene could have come from a Gram-negative species aswell. Nevertheless, S. salivarius, S. mitis, and S. oralis are theleading candidates because they co-exist in the oral cavity withpneumococci and many antibiotic-resistance genes have been linked to S.oxalis.

When the wciNβ region was examined for multiple 6C isolates, theircross-over points and flanking region sequences were found to beidentical. Also, their capsule gene locus profiles are highly limited to9-10-1 in contrast to 6A isolates, which have many different capsulegene locus profiles. Mavroidi et al., 2004. In addition, the 9-10-1capsule gene profile is unusual among and largely segregated from thecapsule gene profiles of the 6A and 6B isolates. These findings clearlyindicate that the capture of the wciNβ gene must have taken place onceand that all the 6C isolates are found through out the world and causingmany types of diseases have the capsule gene locus from the singlebacterium that originally became 6C. Because 6C may provide a unique andclear example of foreign gene capture, it would be a good model forstudying bacterial genetic evolution. This may also constitute a stablechange, unlike antibiotic resistance genes.

The 6C serotype has only one or two capsule gene locus profile(s)whereas the 6A and 6B serotypes have diverse capsule gene locusprofiles. Mavroidi et al., 2004. Thus, the 6C capsule gene locus mayhave appeared much more recently compared with the 6A or 6B capsule geneloci. Although 6C may have appeared more than twenty-seven years ago,these findings suggest the 6C serotype capsule gene locus appeared“recently” in one place and spread quickly through out the world. When agene provides strong survival advantage, the gene can spread quicklythroughout the world. For example, an antibiotic-resistance gene mayspread worldwide within only years. Perhaps natural human antibodies areless effective against 6C than against 6A or 6B. Whether the 6C capsulegene locus provides more survival advantage than 6A or 6B should beinvestigated.

MLST studies show that 6C expresses multiple independent STs. Thus, the6C capsule gene locus must have been exchanged among differentpneumococcal isolates. Whether the 6C capsule gene locus may combinewith a ST that provides additional survival advantages might beinvestigated. The spread of 6C and the emergence of the 6C capsule locusamong international strains that have multiple resistance genes shouldbe monitored.

The novel pneumococcal isolate provided for herein has a chemicallydistinct PS structure. More specifically, monosaccharide analysisindicated that the galactose found in the 6Aα capsular PS is absent inthe 6Aβ PS, which contains glucose instead. The repeating units of the6Aβ PS apparently contain one ribitol, one rhamnose, and two glucosemoieties. Therefore, the two subtypes 6Aα and 6Aβ described hereinshould be recognized as different serotypes. The 6Aβ subtype may bedescribed as serotype 6C while leaving the 6Aα subtype assigned to theserotype 6A. Serotype 6C should be included as the third member ofserogroup 6 in view of its serological and structural relation toserotype 6A. Serotype 6C would thus represent the 91st pneumococcalserotype, with 90 pneumococcal serotypes having been previouslyrecognized. Henrichsen, 33 J. Clin. Microbiol. 2759-62 (1995).

Galactose and glucose molecules differ only in the orientation of thehydroxyl group attached to their fourth carbon, and the repeating unitsof 6A and 6C PS differ only in the orientation of one hydroxyl group.This small structural difference explains why 6C was not identified withpolyclonal antisera in the past. With the elucidation of the chemicalstructure, 6C can be biochemically distinguished from 6A by carbohydratecomposition analysis or by NMR. Pneumococcal capsular PS can beidentified by simple proton NMR of anomeric protons. Abeygunawardana etal., 279 Anal. Biochem. 226-40 (2000). Although 6A and 6C NMR patternsdo differ, the NMR pattern of the anomeric protons of 6C is very similarto that of 6A. Although chemical and genetic tests can be used,serological methods may be the most useful way to identify 6C usingeither our monoclonal antibodies or polyclonal antisera made specific byabsorption.

Serogroup 6 has been known to contain three epitopes: 6a, 6b, and 6c.Henrichsen, 1995. Epitope 6a is known to be present in both serotypes 6Aand 6B whereas epitopes 6b and 6c are found only in either serotype 6Aor 6B, respectively. Discovery of the 6C serotype indicates the presenceof additional epitopes within serogroup 6. The mAb Hyp6AM3, whichrecognizes 6A and 6B but not 6C, should recognize epitope 6b. BecausemAb Hyp6AG1 recognizes 6A and 6C, it may be defined as recognizing a newepitope “6d”. Another mAb binding to all three serotypes (6A, 6B, and6C) and the shared epitope may be defined as “6e”. Aconfirmation-dependent epitope for serotypes 6A and 6B has also beendescribed. Sun et al., 69 Infect. Immun. 336-44 (2001). The observationof so many epitopes for serogroup 6 is consistent with a previousobservation that even a simple linear homopolymer of sialic acid canhave at least three epitopes. Rubenstein & Stein, 141 J. Immunol.4357-62 (1988). Indeed, pneumococcal PS have many more epitopes thanpreviously defined (Henrichsen, 1995), and that the presence of manyepitopes increases chances of altering epitopes during the manufactureof pneumococcal conjugate vaccines.

The discovery of serotype 6C was quite unexpected because serogroup 6has been extensively studied following its discovery in 1929.Heidelberger & Rebers, 1960. One should therefore consider thepossibility that additional subtypes (or serotypes) are present amongeven well-established and extensively characterized serogroups. Forinstance, one may need to consider the possible presence of subtypesamong serotype 19A because two chemical structures for the 19A capsularPS have been reported. Kamerling, Pneumococcal polysaccharides: achemical view, in MOL. BIOL. & MECHANISMS OF DISEASE 81-114 (Mary AnnLiebert, Larchmont, 2000). If 19A subtypes are found, their presence mayhelp us explain the rapid increase in the prevalence of serotype “19A”seen after the introduction of the pneumococcal conjugate vaccine. Paiet al., 192 J. Infect. Dis. 1988-95 (2005). In addition, one shouldconsider the possibility that 6C may have arisen recently. Consistentwith this possibility, the genetic studies suggest that the 6C serotypecapsule gene locus is not as diverse (Lin et al., 44 J. Clin. Micro.383-(1988)), as is the 6A locus (Mavroidi et al., 2004). It would beinteresting to investigate the origin and spread of 6C strains bystudying pneumococcal isolates obtained a long time ago (perhaps 50-100years ago).

Currently available pneumococcal vaccines contain only 6B PS because itis presumed to induce cross-protection against 6A. As a part ofpneumococcal vaccine efficacy surveys, all the pneumococcal isolatesfound in the USA are now tested for serotypes 6A and 6B.Cross-protection against 6C may differ, however, from that against 6A.Because 6C and 6B PSs have two structural differences whereas 6A and 6BPSs have only one structural difference, the cross-protection against 6Cmay be inadequate and the currently available pneumococcal vaccines mayreduce the prevalence of 6A but not 6C. In fact, current pneumococcalvaccines may help 6C become more prevalent than before, just as occurredfor serotype 19A. Thus, all pneumococcal isolates should be tested forserotype 6C as well as for serotypes 6A and 6B.

Importantly, the novel serotype 6C provided herein may be useful in avaccine or in pneumococcal vaccine development. For example the 6C PS, aportion of that PS, or a mimetic of the PS or a portion of the PS may beincorporated into a pneumococcal vaccine. Conjugate vaccines comprisingstreptococcal and pneumococcal PS are well-known in the art. See e.g.,U.S. Pat. Nos. 6,248,570; 5,866,135; 5,773,007. PS mimotopes, such asprotein or peptide mimetics of polysaccharide molecules, are alsopossible as alternative antigens or immunogens. See, e.g., Pincus etal., 160. J. Immunol. 293-98 (1998); Shin et al., 168 J. Immunol.6273-78 (2002). Additionally, the proteins or nucleic acids of 6C mayserve as antigens or immunogens in vaccine or vaccine development usingany number of techniques known in the art. See, e.g., U.S. Pat. No.6,936,252. One or more adjuvant agents may be included in such vaccines.The delivery of pneumococcal vaccines, either by parenteral, mucosal, orother administration, and the design, monitoring, and dosing regimens ofsuch vaccines, are well-known in the art.

Additionally, the 6C serotype may be useful in vaccine developmentbecause the bacterium would be used as the target in an opsonization orELISA assays using sera or antibodies raised by test vaccines. Theantigens of the 6C serotype may also be used to raise antibodies thatmight be used for passive protection. Such methods are also well-knownin the art.

The 6C serotype is also useful to monitor vaccine efficacy: The 6Aα and6C serotypes must be distinguished in epidemiological studies involvingpneumococcal vaccines and in studies of pneumococcal vaccine efficacy.For example, if a pneumococcal vaccine is effective against 6Aα but not6C, the vaccine may not be effective in areas where the 6Aβ serotype isprevalent. This would be the case because pneumococcal vaccines elicitantibodies opsonizing 6Aβ only occasionally. Also, usage of conventionalpneumococcal vaccines may well alter the prevalence of 6C: theprevalence of 6C may increase although the prevalence of 6Aα decreases.Preliminary data shown below suggests that 6C prevalence is unchangedwhereas 6Aα prevalence has decreased with the use of conjugate vaccinessince 2000. Without distinguishing between the serotypes, it may bedifficult to deploy a vaccine or assess its efficacy. At present, thenew serotype can be identified by the antibodies as disclosed herein,but additional genetic and biochemical tests may be devised and areenvisioned by the present invention.

Moreover, the prevalence of the 6C serotype should be monitoredglobally, providing valuable information on the emergence of newpneumococci in areas with and without pneumococcal vaccine distribution.The 6C serotype has also been identified in Brazil, Canada, China,Korea, Mexico, and the United States.

To that end, the monoclonal antibodies of the present invention areuseful in identifying the 6Aβ serotype. To wit, the 6A serotype (both6Aα and 6Aβ), are identified by the mAb Hyp6AG1, but 6C serotype doesnot react with the mAb Hyp6AM6 or mAb Hyp6 Am3. Hence, Hyp6AM6 orHyp6AM3 may be used as a negative control from which 6Aα and 6C can beidentified.

Using these monoclonal antibodies, the prevalence of 6A and 6Aβ (6C)among the United States pneumococcal isolates submitted to the CDC wereanalyzed. Approximately the same number of pneumococcal isolates weresubmitted to the CDC from 1999 to 2006. Specimens typed as “6A” by theold method were reanalyzed using the monoclonal antibodies describedherein. Almost all the “6A” specimens received in 1999, 2003, and 2004were reanalyzed. Only a fraction of the samples the CDC received in 2005and 2006 were reanalyzed. As seen in the table, the prevalence of6A(6Aα) decreased but the prevalence of 6C remained the same. Thissuggests that the currently available pneumococcal vaccine may not beeffective against 6C.

1999 2003 2004 2005 2006 All ages 6A 169 132 51 16 16 6C 41 40 57 21 23

Additionally, the identification of 6C provides for the production andisolation of anti-6C antibodies. Also, its identification allows one toproduce reagents specific for 6A(6Aα) as shown by the conventional“6A”-specific reagents recognizing both 6A(6Aα) and 6C(6Aβ). These Canbe prepared by conventional means well known in the art in light of thecurrent specification. In this regarding anti-6C antibodies includesboth intact immunoglobulin molecules as well as portions, fragments,peptides and derivatives thereof, such as, for example, Fab, Fab′,F(ab′)₂, Fv, CDR regions, or any portion or peptide sequence of anantibody that is capable of binding a 6C antigen, epitope, or mimotope,all of which may also be referred to as an “antigen binding protein.” Anantibody or antigen binding protein is said to be “capable of binding” amolecule if it is capable of specifically reacting with the molecule tothereby bind the molecule to the antibody or antigen binding protein.See, e.g., WO/US2006/014720; WO/US2006/015373.

Serotype 6C has been deposited with the American Type Culture Collectionin accord with the Budapest Treaty.

The invention will now be described further by non-limiting examples.

EXAMPLES Example 1 Identification of Pneumococcal Serotypes

Collection of pneumococcal lysates: The pneumococci serotype 6Aβ (SeeLin et al., 44 J. Clin. Micro. 383-88 (2006)) was isolated in a blindedstudy using 495 clinical isolates: Fifty isolates were from Mexico, 100from Denmark, and 345 from Brazil. Twenty-two isolates were fromasymptomatic carriers of pneumococci in the nasopharynx and 475 isolateswere from patients with invasive pneumococcal infections such asmeningitis and sepsis. In addition, control pneumococcal strainsexpressing serotypes 11A, 11B, 11C, 11D, and 11F were purchased fromStatens Serum Institut (Copenhagen, Denmark).

Lysates of the clinical isolates were prepared in the country of origin.Three hundred microliters of Todd-Hewitt medium with 0.5% yeast extract(THY medium) was inoculated with a single colony of pneumococci. Afteran overnight incubation at 37° C., cells were lysed with 0 μl of lysingsolution (0.2% sodium deoxycholate, 0.02% SDS, 0.1% sodium azide, 0.3 Msodium citrate, pH 7.8). In Brazil, 400 μl of THY medium was used forbacterial growth and 100 μl was removed to store the bacteria frozenbefore mixing the remaining 300 μl with 50 μl of lysing solution. InDenmark, 325 μl of THY medium and 25 μl of lysing solution were used.Bacteria were lysed by incubating the mixture at 37° C. The lysates werecoded and shipped to the University of Alabama at Birmingham (UAB)laboratory for serotype testing by regular mail at ambient temperature.

To simplify the shipping of bacterial lysates from distant sites to UABfor the multibead assay, the stability of bacterial lysates was comparedafter storage at room temperature (RT) or 37° C. The work revealed thatbacterial lysates can be stored at RT for up to one month or at 37° C.for several days without affecting the results of the multibead assay.Thus, the regular postal mail system was used to ship all the lysates inthis study at ambient temperature without any thermal protection.

Serological Reagents: All the polyclonal serotyping sera were made inrabbits and were obtained from Statens Serum Institut. They includetwelve serum pools for serogrouping and various type- or factor-specificantisera. Sorensen, 31 J. Clin. Microbiol. 2097-2100 (1993). All themAbs were produced as described, and hybridoma culture supernatants wereused. Yu et al., 2005.

Multibead assay: This assay was performed as described using twodifferent sets of latex beads. Yu et al., 2005. One set of beads (Set 1)was a mixture of fourteen different latex beads, each coated with onepneumococcal PS antigen. The fourteen pneumococcal PS antigens wereserotypes 1, 3, 4, 5, 6A, 6B, 7F, 9N, 9V, 14; 18C, 19A, 19F, and 23F.Bead Set 2 was created by coating each of ten bead types with one of tendifferent pneumococcal PS (serotypes 2, 8, 10A, 11A, 12F, 15B, 17F, 20,22F, and 33F).

Set 1 beads were mixed with either 5× or 20× diluted bacterial lysateand a mixture of mAbs specific for the pneumococcal capsular PScontained on the beads. After incubation and washing, the bead mixturewas reacted with fluorescein-conjugated anti-mouse immunoglobulinantibody. Set 2 beads were used the same as Set 1 beads except that amixture of polyclonal rabbit antisera (Statens Serum Institut) andfluorescein-conjugated anti-rabbit immunoglobulin antibody were used.After incubation, the amount of fluorescence of each bead type wasdetermined with a flow cytometer (FACSCalibur, Beckton Dickinson, SanJose Calif.). The fluorescence of each bead type was then used todetermine its serotype. Fluorescence inhibitions greater than 67% wereused as positives.

Neufeld's test: This assay was performed as described (Henrichsen, 33 J.Clin. Microbiol. 2759-62 (1995); Konradsen, 23 Vaccine 1368-73 (2005);Lund, 23 Bull. Wld Hlth Org. 5-13. (1960)) by the reference laboratoriesin Denmark, Brazil, and Mexico using standard serogrouping (Sorensen,1993) and serotyping rabbit antisera from Statens Serum Institut.

Dot blot assay: To investigate discrepant results, this assay wasperformed as described (Fenoll et al., 35 J. Clin. Micro. 764-76(1997)), using pneumococcal antisera from Statens Serum Institut to thefollowing serogroups or serotypes: 1, 4, 5, 6, 7, 8, 9, 11, 12, 14, 18,and 23. Monoclonal antibodies specific for 6A (Hyp6AM3) and 18C(Hyp18CM1) were also used in some cases. Briefly, heat-killedpneumococci grown in THY medium were spotted on strips of nitrocellulosemembranes. After drying, the strips were blocked and washed. Strips werethen incubated in a diluted antiserum or mAb solution for 1 hour, washedand exposed to a diluted goat anti-rabbit or mouseimmunoglobulin-peroxidase conjugate. After one hour incubation at roomtemperature, the strips were washed and exposed to 3-amino-9ethylcarbazole solution. When the spots appeared, the strips were washedand evaluated.

PCR reactions: Pneumococci were grown in THY medium to an OD of 0.8 at650 nm wavelength. Chromosomal DNA was prepared using the InvitrogenEASY-DNA kit and following the given instructions, beginning with a 4 mlsample of the THY-grown pneumococci concentrated to 1 ml (Invitrogen,Carlsbad, Calif.). For serogroup 6 determination, PCR was performedusing chromosomal DNA as template and primers wciP-up, 5′-ATG GTG AGAGAT ATT TGT CAC-3′ (SEQ ID NO:3) and wciP-down, 5′-AGC ATG ATG GTA TATAAG CC-3′ (SEQ ID NO:4). PCR thermocycling conditions were as describedin Mavroidi et al., 2004. A Qiagen PCR cleanup column (Qiagen, Valencia,Calif.) was used to remove excess primer from the PCR reactions and thePCR was submitted as DNA template for automated DNA sequencing using thewciP-up primer. Results were analyzed with the aid of the Sequencher(GeneCodes, Inc., Ann Arbor, Mich.) and the MacVector Sequence Analysis(Accelyrs, San Diego, Calif.).

For serotype 11A determination, PCR for a part of the capsule gene locuswas performed as described (Mavroidi et al., 2004), using chromosomalDNA as the template, 1 μl of forward primer (50 pmol), and 1 μl ofreverse primer (50 pmol). Primers were 11A forward, 5′-GGA CAT GTT CAGGTG ATT TCC CAA TAT AGT G-3′ (SEQ ID NO:5) and 11A reverse 5′-GAT TATGAG TGT AAT TTA TTC CAA CTT CTC CC-3′ (SEQ ID NO:6). PCR cycling beganwith 94° C. for 5 minutes, followed by 30 cycles of 94° C. for 1minutes, 50° C. for 1 minute and 72° C. for 2 minutes, followed by afinal extension of 72° C. for 10 minutes. The PCR reaction products wereanalyzed by agarose gel electrophoresis (Tris-acetate buffer 0.8%agarose) to determine the amplicon size.

Study of fifty isolates from Mexico: The fifty isolates from Mexico weregrown in THY medium, lysed, and sent to UAB for typing. When themultibead assay results were compared with the Neufeld's test results,results from ten samples were discrepant. When new lysates of eight ofthe discrepant samples were obtained and re-examined in a blind fashion,all results matched, suggesting that the discrepancies were largely dueto mislabeling. Two isolates (MX24 and MX37) that were typed to beserotype 3 and 10A by the Neufeld's test were originally typed asnon-typeable (NT) by the multibead assay. Because the two serotypesshould have been identified by the multibead assay, the two bacterialisolates were sent to the UAB laboratory for further study. There theywere found to grow well in THY medium, with the new lysates producingresults matching the Neufeld's test results. Thus, the two isolates wereinitially falsely identified as negatives by multibead assay, mostlikely due to insufficient growth of pneumococci.

Study of 100 isolates from Denmark: When the multibead assay results ofone hundred Denmark isolates were compared with the Neufeld's testresults, we found four errors in transcribing the Neufeld's test resultsand one strain (DK94) was typed as serotype 20 by the Neufeld's test andas NT by the multibead assay (Table 1).

TABLE 1 Serotyping results with both serotyping assays and the finalresults after the investigations. Serotype^(#) Multibead Neufeld Finalresults  1 30 30 30  2  1  1 1  3 22 22 22  4 20 20 20  5 11 11 11  6A 16* 21 21  6B 24 24 24  7F/A 14 14 14  8 13 13 13  9V 18 18 18  9N/L 1212 12 10A/B/39/33C 12 12 12 11A/D/F  8* 10 9 12F/A/B 16 16 16 14 52 5252 15B/C 10 10 10 17F  6  6 6 18C 28  27* 28 19A 18 18 18 19F 26 26 2620   3**  4 4 22F/A  6  6 6 23F 19 19 19 33F/A  6  6 6 NT 104  97 97Total 495  495  495 ^(#)NT indicates non-typeable serotypes by themultibead assay. 7F/A means that the isolate may express either 7F or 7Aserotypes. 10A/B/39/33C indicates that the isolate may express serotype10A, 10B, 39 or 33C. *After additional studies of Brazilian isolates, itwas concluded that the multibead assay failed to identify five 6Astrains (with Hyp6AM3) and one 11A strain, and that Neufeld's testfailed to identify one 18C strain and falsely identified one strain as11A. **One Danish strain had high background signal and was not detectedduring the initial multibead assay.

When the DK94 isolate was re-grown in THY and re-examined, it producedalmost no inhibition (9%) at a 1:5 dilution, but it produced moreinhibition at higher dilutions (35% at a 1:20 dilution and 50% at a1:320 dilution). This unexpected behavior suggested the presence ofnon-specific binding material in the lysate of this specific isolate.When the PS in the lysate was precipitated with 70% ethanol and theethanol precipitate was examined with the multibead assay, theprecipitate produced a clear inhibition for serotype 20 (86% at a 1:5dilution and 81% at a 1:20 dilution). Thus, the initial discrepancy wasdue to non-specific binding, which was occasionally observed with theassays performed with polyclonal sera, and there is no intrinsic problemin assay sensitivity and specificity with clinical isolates.

Study of 345 samples from Brazil: When the results of 345 Brazilianisolates obtained with the two assay methods were compared, there werethirty-eight mismatches. When these thirty-eight samples werere-examined by investigating test records and retesting by Neufeld'stest in Brazil, seventeen of the mismatches could be explained as typingmistakes or sample misidentification. One of the seventeen mismatcheswas strain BZ652. This was initially typed as 18B, but was determined tobe 6A because it was typed as weakly 6A by Neufeld's test and was typedas serogroup 6 by the dot blot assay using the polyclonal antisera andmAb Hyp6AM3. When the twenty-one remaining mismatched samples wereregrown in THY medium and retested by multibead assay, the new resultsof thirteen isolates matched the Neufeld's test results. When theoriginal multibead assay results of the thirteen isolates werere-examined, three isolates produced weak and incomplete inhibitions(inhibitions were less than 67%) for the appropriate serotype in theoriginal multibead. Although twelve isolates were initially typed as NT,one isolate (BZ52) was initially typed as type 3. It was retyped as NTwith the second sample and the result became consistent with theNeufeld's test result (Table 1) (above).

After these re-examinations, eight discrepancies were reproducible andstill unexplained (Table 2 and Table 3): five isolates were typed as 6Aby the Neufeld's test but as NT by the multibead assay, two isolates(BZ435 and BZ705) were typed as 11A by the Neufeld's test but as NT bythe multibead assay, and one isolate (BZ438) was typed as NT by theNeufeld's test but as 18C by the multibead assay. By the Neufeld's test,BZ438 did not react with several lots of serogrouping Poolsera A and Q(Sorensen, 1993), which should react with serogroup 18 pneumococci. Italso did not react with several different lots of antisera specific forserogroup 18 or specific for factors 18c, 18d, 18e, and 18f. BZ438produced positive dot blot results, however, with a serogroup18-specific polyclonal serum or with mAb Hyp18CM1 (Yu et al., 2005).Thus, the BZ438 isolate was considered to be 18C.

Strains BZ435 and BZ705 were considered to be 11A by the Neufeld's testbut not 11A, 11D, or 11F by the multibead assay. Because the standardmultibead assay uses a polyclonal antiserum against serogroup 11 (Yu etal., 2005), we examined the two strains with two mAbs.(Hyp11AM1 andHyp11AM2) that are specific for serotypes 11A, 11D, and 11F and thatwere recently produced in the UAB laboratory (Table 2). We found thatHyp11AM1 recognizes BZ435 but not BZ705. Interestingly, Hyp11AM2recognized neither strain, suggesting heterogeneity among the strainsexpressing the 11A serotype. A PCR test produces 463 base pair amplimerswith strains for 11A, 11D, and 11F but not for 11B and 11C (Table 2).When both strains were tested by this PCR, BZ435 was positive, but BZ705was not. Although the Neufeld's test showed that both strains reactedwith antisera specific for factor 11c, the Neufeld's test also revealeddifferences between them: BZ435 but not BZ705 reacted with Poolserum T(Sorensen, 1993), with serogroup 11 antisera, or with 11f factor serum.BZ705 yielded ambiguous results for factor 11b expression and thissuggested that it could be serotype 11D. In a dot blot test forserogroup 11 using rabbit typing serum, however, BZ435 was positive butthat BZ705 was negative. Considering all of these results, it appearedthat BZ435 is an 11A strain and that BZ705 is not 11A, 11D, or 11F.BZ705 may belong to the 11C serotype since BZ705 expresses the 11cepitope (and reacts with 11c antisera) that is not expressed on 11Bstrains.

TABLE 2 Studies of two strains for the 11A serotype with Neufeld,multibead, PCR, and dot blot assays Multibead assay Neufeld's test withWith With With Dot blot assay Strains rabbit sera^(#) rabbit sera^(#)Hyp11AM1 Hyp11AM2 PCR with rabbit sera^(#) BZ435 11A − + − + + BZ705 11A* − − − − − Control Not tested + + + + Not tested Strain 11A ControlNot tested − − − − Not tested Strain 11B Control Not tested − − − − Nottested Strain 11C Control Not tested + + + + Not tested Strain 11DControl Not tested + + + + Not tested Strain 11F ^(#)All the rabbit serawere from Statens Serum Institut (Denmark). *In the Neufeld's test,BZ705 did not react with Poolserum T (25) and factor serum 11f, but itdid react strongly with factor serum 11c and ambiguously with factorserum 11b.

To investigate the remaining discrepant strains that were 6A, the DNAsequence of the wciP gene was examined. A recent study reported that thecapsular PS of 6A and 6B has repeating units with rhamnose linked toribitol. The linkage is 1-3 for 6A and 1-4 for 6B. The study found thatthe rhamnosyl transferase is likely encoded by the wciP gene in thecapsule locus, that wciP for 6A encodes serine at residue 195, and thatwciP for 6B encodes asparagine at residue 195 (Mavroidi et al., 2004).Also, wciP alleles 1, 2, 7, 9, and 11 are exclusively associated withserotype 6A, and alleles 3, 4, 5, 6, 8, and 12 are associated withserotype 6B. (Mavroidi et al., 2004).

Bacterial DNA was obtained from the five isolates labeled 6A as well asBZ652, which was considered to be only weakly 6A by the Neufeld's test.This DNA was amplified a part of the wciP gene by PCR, sequenced theamplicon, and examined the sequence (645 base pairs). Five samples wereamplified successfully, and their sequences were consistent with a 6Aserotype because they expressed alleles associated with the 6A serotype(Table 3) and expressed serine at amino acid residue 195. Compared tothe prototypic sequence of allele 2 wciP, the wciP sequence of BZ652 hasfive base pair changes with three potential amino acid replacements.Three isolates (BZ17, BZ39, and BZ86) express the identical wciP genesequence with one identical nucleotide variation from the prototypicsequence for allele #9 and may, therefore, be clonally related (Table3).

TABLE 3 Studies of six strains for 6A serotype with Neufeld, multibead,and PCR assays. Neufeld's test with PCR for Multibead assay polyclonalwciP Polyclonal Names antisera allele* Hyp6AM3 sera Hyp6AG1 BZ17 6A #9(1) NT 6A 6A BZ39 6A #9 (1) NT 6A 6A BZ86 6A #9 (1) NT 6A 6A BZ650 6A #1NT 6A 6A BZ652^(#) NT (6A)^($) #2 (5) 6A 6A 6A BZ1048 6A Not NT 6A 6Adone *The number in parentheses indicates the number of base pairsdifferent from the proband sequence (Mavroidi et al., 2004). BZ652 hasfive base pair differences that produce three amino acid differences.All these alleles express serine at amino acid residue 195. ^($)BZ652was initially typed as NT (non-typeable) but was typed as weakly 6A onre-examination.

Because the DNA study suggested that these isolates may belong to the 6Aserotype, these isolates were examined with the multibead assays usingpolyclonal antisera. All six isolates were typed as 6A (Table 3). Whenthey were typed with nineteen different 6A-specific mAbs in addition toHyp6AM3, one mAb (Hyp6AG1) identified the six isolates as 6A (Table 3).When Hyp6AG1 was used to retest forty-six 6A isolates (twenty-one fromthis study and twenty-four in the UAB laboratory collection), it wasfound that this mAb identified all of them as 6A and that it did notrecognize any of the eighty-nine non-6A serotypes, including theforty-three isolates expressing the 6B serotype. Thus, it was clear thatall these six isolates are 6A and that Hyp6AG1 recognizes all 6Aisolates. Also, mAb Hyp6AM3 recognizes a subset of 6A isolates, althoughthat subset is very large.

Example 2 Pneumococcal Serotype 6Aβ Isolates from Different Countrieshave the Molecular Characteristics Associated with 6A

As described above, Brazilian isolates that did not react with both mAbspreviously associated with serotype 6A were shown to belong to the 6Aserotype by examining the wciP allele. Thus, the inventors examined wciPgene of ten 6Aβ isolates from geographically diverse locations.Brazilian isolates collected in 2003 and in 2004, USA isolates and oneisolate from Korea were examined. The sequences clearly showed that allten isolates have the genetic characteristics associated with 6Aserotype.

Example 3 6Aβ Isolates from Different Areas have Uniform SerologicalCharacteristics

To investigate serological characteristics of the 6Aβ isolates in aquantitative manner, isolates were examined using an inhibition assay.The assay measured inhibition by bacterial lysates of various anti-6Aantibodies binding to 6A PS-coated ELISA plates. Briefly, the wells ofELISA plates (Corning Costar Corp., Acton, Mass.) were coated at 37° C.with 5 μg/mL of 6A capsular PS (a gift of G. Schiffman, Brooklyn, N.Y.)overnight in PBS. After washing the plates with PBS containing 0.05% ofTween 20, previously diluted bacterial culture supernatant (or lysates)was added to the wells along with an anti-6A antibody. Pneumococcallystates were prepared by growing pneumococci in 10 mL of Todd-Hewlettbroth supplemented with 0.5% yeast extract (THY) without shaking untilthe tubes became turbid and then incubating the tubes for 15 minutes at37° C. with a lysis buffer (0.1% sodium deoxycholate, 0.01% SDS, and0.15M sodium citrate in deionized water). Hyp6AG1 mAb was used at a1:250 dilution, and Hyp6AN3 mAb was used at 1:100 dilution. Pool Q andfactor “6b” rabbit antisera from Staten Serum Institute (Copenhagen, DK)were used at a 1:500 dilution. After thirty minutes of incubation in ahumid incubator at 37° C., the plates were washed and incubated for twohours with alkaline phosphatase-conjugated goat anti-mouse Ig (Sigma,St. Louis, Mo.) or alkaline phosphatase-conjugated-goat anti-rabbit Ig(Biosource, Camarillo, Calif.). The amount of the enzyme immobilized tothe wells was determined with paranitrophenyl phosphate substrate(Sigma) in diethanolamine buffer. The optical density at 405 nm was readwith a microplate reader (BioTek Instruments Inc. Winooski, Vt.).

Because the qualitative nature of the quellung reaction may haveprevented detection of 6A subtypes, it was determined whether thesubtypes might be distinguishable with a quantitative assay using therabbit sera used for quellung reactions. This was determined by adaptingthe rabbit sera to an inhibition assay, in which pneumococcal lysateswere allowed to inhibit the binding of rabbit antisera to 6A PSimmobilized on ELISA plates (FIG. 1). As a control, pneumococcal lysateswere tested for inhibition of the two mAbs: Hyp6AG1 and Hyp6 AM3 (FIGS.1A and 1B). Lysates of three 6Aα isolates (CHPA378 from the U.S.A., KK58from Korea, and ST558 from Brazil) inhibited both mAbs, and lysates oftwo 6β isolates (strains ST400 and ST518 from Brazil) inhibited neithermAb (FIGS. 1A and 1B). Two lysates of 6Aβ isolates (strains BZ17 andBZ650 from Brazil) clearly inhibited the binding of Hyp6AG1, even at a1:1000 dilution (FIG. 1A). They showed almost no inhibition, however, ofHyp6AM3, even at a 1:10 dilution (FIG. 1B).

When the pneumococcal lysates were examined for inhibiting Pool Q (arabbit antiserum Often used for serotyping (Sorensen, 31 J. Clin.Microbiol. 2097-2100 (1993)), both lysates of 6Aα and 6Aβ could inhibitequally well, but the 6B lysates could not inhibit (FIG. 1C). When a“6b”-factor-specific rabbit serum was tested, all 6Aα, 6Aβ, and 6Cisolates could inhibit the factor serum equally well (FIG. 1D). Becausethe 6b-factor serum is designed to be 6A-specific, this was unexpected.The factor serum is designed to be specific in quellung reactions,however, not in this inhibition assay. Nevertheless, this experimentshowed that rabbit antisera commonly used for pneumococcal typing do notdistinguish between the 6A and 6C subtypes.

The inventors also compared various 6Aα, 6Aβ, and 6B isolates using anopsonization assay and a human serum with a high level of anti-6Bantibodies. Although the human serum opsonized 6B as well as 6Aα (FIG.2), it did not opsonize seven different 6Aβ isolates from Brazil, Korea,and the United States (FIG. 2).

Example 4 Human Antisera are not Equally Protective Against the Two 6ASubtypes

Because a human antiserum can opsonize 6Aα and 6B but not 6Aβ, theinventors systematically examined serum samples from twenty adults foropsonizing 6Aα, 6B, and 6Aβ serotypes (FIG. 3A). None of the serumdonors were vaccinated with a pneumococcal vaccine at least for 5 years.Although most individuals have low opsonization titers, fourindividuals, had opsonization titers greater than 100 for serotype 6B.Sera from the four individuals had significant opsonization titersagainst 6Aα, but only one had a significant titer against 6Aβ. Theobservation suggests that the adult population has less natural immunityagainst 6Aβ than against 6Aα.

To examine whether immunization with 6B induces antibodiescross-reacting with 6Aβ, the inventors studied sera from twenty adultswho were immunized with a pneumococcal vaccine (FIG. 3B). Ten wereimmunized with a 9-valent pneumococcal conjugate vaccine (PCV) and tenwere with a 23-valent PS vaccine (PPV). Eight of the ten personsimmunized with PCV had a high (>100) opsonization titer for 6B. Of theseeight, seven persons had an opsonization titer against 6Aα commensuratewith 6B titer but only one person had 6Aβ titer commensurate with 6Btiter. Because the person's serum opsonized 6Aα almost as well as 6B, itis likely that the elicited anti-6B antibodies that were cross-reactingwith 6Aβ. When PPV vaccinees were examined, five persons had a highopsonization titer (>100) against 6B, two persons had a high titeragainst 6Aα, but none had a high titer against 6Aβ (FIG. 3B). Takentogether, these findings suggest that currently available pneumococcalvaccines may provide protection against 6Aα better than against 6Aβinfections.

Example 5 Development of Monoclonal Antibodies Useful to IdentifyPneumococci

Mouse hybridomas were produced as described previously. Yu et al., 2005(citing Sun et al., 69 Infect. Immun. 336-44 (2001). Briefly, BALB/cmice were immunized twice subcutaneously with PS-protein conjugate (days0 and 21) and once intraperitoneally on day 59. The immunogen for sevenserotypes (4, 6B, 9V, 14, 18C, 19F and 23F) was Prevnar (Wyeth LederleVaccines, Pearl River, N.Y.). Conjugates used for serotypes 5 and 7Fwere prepared at the U.S. Food and Drug Administration (Bethesda, Md.),the 6A conjugate was a gift of Porter Anderson (Rochester, N.Y.), andconjugates of serotypes 1, 3, and 9N to ovalbumin were prepared asfollows. Cyanogen bromide-activated PS was coupled to ovalbumin duringan overnight incubation. The PS-protein conjugate was purified from thereaction mixture with a molecular weight sizing column. Each dosecontained 1 μg of PS for serotypes 4, 9V, 18C, 19F, and 23F; 2 μg forserotypes 3 and 6B; and 10 μg for serotypes 1, 5, 6A, 7F, and 9N. Theprimary and secondary immunogens contained 10 μg of Quil A (SigmaChemical, St. Louis, Mo.).

Three days after the last immunization, the mice were sacrificed, thespleens harvested, and the splenocytes fused with SP2/0 Ag-14 asdescribed previously. Nahm et al., 129 J. Immunol. 1513-18 (1982).Primary culture wells were screened for the production of desirableantibodies, and the wells producing such antibodies were cloned twice bylimiting dilution. A human-mouse hybridoma, Dob9, was produced byhybridizing peripheral blood lymphocytes from a person immunized with a23-valent PB vaccine, as described previously. Sun et al., 67 Infect.Immun. 1172-79 (1999).

The human-mouse hybridoma is specific for pneumococcal serotypes 19A and19F. All hybridomas produced either IgM or IgG antibodies, excepting oneIgA producer. Hyp6AG1 is IgG and Hyp6AM6 is IgM.

A total of twenty-one hybridomas specific for 6A serotypes wereisolated. Many have similar serological behavior and some may be sisterclones (i.e., some may have the identical variable region structure).Names of 6A-specific hybridomas produced are Hyp6A1, Hyp6AM1, Hyp6AM2,Hyp6AM3, Hyp6AM4, Hyp6AM5, Hyp6AM6, Hyp6AM7, Hyp6AM8, Hyp6AM9, Hyp6AM10,Hyp6AM11, Hyp6AM12, Hyp6AM13, Hyp6AG1, Hyp6AG2, Hyp6AG3, Hyp6AG4,Hyp6AG5, Hyp6AG6, Hyp6AG7.

Example 6 Genetic Study of 6Aβ

A non-capsulated pneumococcal strain could be easily transformed withgenes from a 6Aβ isolate (unpublished data). This finding suggests that6Aβ capsule synthesis requires one (not multiple) gene fragment, mostlikely the capsule gene locus. To identify the gene(s) responsible for6Aβ expression, the inventors examined three transferases (wciN, wciO,and wciP). The wciP gene may be identical between 6Aα and 6Aβ isolates(as discussed above). When wciN region was examined by PCR using primers5016 and 3101 (5106: 5′-TAC CAT GCA GGG TGG AAT GT (SEQ ID NO:1) and3101: 5′-CCA TCC TTC GAG TAT TGC (SEQ ID NO:2)), all nine 6Aα isolatesexamined yielded about 200 base pair (bp) longer product than all six6Aβ isolates examined did (FIG. 5). The six isolates included 6Aβisolates from Korea, USA, and Brazil. Thus, this PCR can be used as agenetic test for 6A subtypes.

The nucleotide sequences of the PCR products from one 6Aα isolate (AAU9)and one 6Aβ isolate (ST745) were then determined (FIG. 6). All the basesbetween positions 1203 to 2959 (1757 bases) in AAU9 PCR product weresequenced and the sequence was found to be homologous to CR931638, whichis the capsule locus sequence of a 6A isolate reported in the GenBankdatabase. By contrast, the ST745 sequence was found to be almostidentical to that of 6Aα up to position 1368, and then again startingfrom position 2591. The intervening 1029 by sequence (from 1369 to 2397)is quite different from that of 6Aα. The intervening sequence containsabout 98 by that is similar to a transferase used for polysaccharidesynthesis by Streptococcus thermophilus.

Example 7 The 6C Isolates have Chemically Distinct Capsules

Two 6C isolates (BZ17 and BZ650), four 6A strains (SP85, ST558, andCHPA378), and two 6B strains (ST400 and ST518) were compared. Allpneumococcal isolates had colony morphologies typical of pneumococci,and were both optochin-sensitive and bile-soluble. Subtyping assays wereconducted as described in Example 3, above.

Polysaccharide isolation and purification: A pneumococcal strain (SP85or BZ17) was grown in two liters of a chemically defined medium (van deRijn et al., 27 Infect. Immun. 444-49 (1980)) from JRH Biosciences(Lenexa, Kans.), which was supplemented with choline chloride, sodiumbicarbonate and cysteine-HCl, and lysed with 0.05% deoxycholate. Afterremoving cell debris by centrifugation, PS was precipitated in 70%ethanol and was recovered by dissolving it in 120 mL of 0.2 M NaCl.After dialyzing the PS in 10 mM Tris-HCl (pH 7.4), the PS was loadedonto a DEAE-Sepharose (Amersham Biosciences, Uppsala, Sweden) column (50ml) and eluted with a NaCl concentration gradient. The resultingfractions were tested for 6Aα or 6Aβ PS with the inhibition assaydescribed above. The PS-containing fractions were pooled, concentratedby ethanol precipitation (70%), dialyzed, and lyophilized. Thelyophilized PS was dissolved in 3 ml of water and loaded onto a gelfiltration column containing 120 ml of Sephacryl S-300 HR (AmershamBiosciences). The PS was eluted from the column with water and all thefractions were tested for 6Aβ PS with the inhibition assay. Thefractions containing the first 6Aα or 6Aβ PS peak were pooled andlyophilized.

Monosaccharide analysis: The lyophilized capsular PS was subjected tomethanolysis in 1.5 M HCl at 80° C. for 16 hr. After evaporating themethanolic HCl, the residue was treated with Tri-Sil reagent (PierceBiotech. Inc. Rockford, Ill.) for 20 min at room temperature. Thereaction products were analyzed on a GLC/MS (Varian 4000, Varian Inc.Palo Alto, Calif.) fitted with a 30 m (0.25 mm in diameter) VF-5capillary column. Column temperate was maintained at 100° C. for 5 mM,and then increased to 275° C. at 20° C./min, and finally held at 275° C.for 5 min. The effluent was analyzed by mass spectrometry using theelectron impact ionization mode.

Oxidation, reduction, and hydrolysis: Capsular PS (1 mg/mL) was treatedwith 40 mM sodium periodate in 80 mM sodium acetate buffer (pH=4) forfour days at 4° C. in the dark. After neutralizing the excess periodatewith ethylene glycol, the sample was dialyzed and lyophilized. Stroop etal., 337 Carbohydr. Res. 335-44 (2002). The PS (1 mg/mL) was reducedwith 200 mg/mL of sodium borohydride (NaBH₄) or its deuterium form(NaBD₄) for three hours at RT, dialyzed, and lyophilized. Theoxidized/reduced 6C PS was hydrolyzed in 0.01 M NaOH at 85° C. forthirty minutes, neutralized by adding 0.01 M HCl, and then directly usedfor mass spectrometry without desalting.

Tandem mass spectrometry: The tandem mass spectral analysis of nativeand oxidized/reduced 6C were performed in the Mass Spectrometry SharedFacility at the University of Alabama at Birmingham with MicromassQ-TOF2 mass spectrometer (Micromass Ltd. Manchester, UK) equipped withan electrospray ion source. The samples, dissolved in distilled water,were injected into the mass spectrometer along with running buffer(50/50 acetonitrile/water containing 0.1% formic acid) at the rate of 1μL/min using a Harvard syringe pump. The injected sample was negativelyionized with electrospray (needle voltage=2.8 kV) and detected with aTOF mass spectrometer. The injected sample was negatively ionized withelectrospray (needle voltage=2.8 kV) and detected with a TOF massspectrometer. For MS/MS, the parent ion was fragmented into daughterions by energizing it to 40 eV before collision with argon gas. Thedaughter ions were analyzed with a TOF mass spectrometer. The MS/MSspectra were processed using the Max-Ent3 module of MassLynx 3.5.

Smith degradation and glycerol detection: Periodate treated 6Aα and 6AβPSs were reduced with 10 mg/ml Sodium borodeuteride in 1M ammoniumhydroxide for 16 hr. Excess sodium borodeuteride was removed by additionof glacial acetic acid and 0.5 ml of methanol:acetic acid (9:1) wasadded. Samples were dried under a stream of nitrogen and washed twicewith 0.25 ml of methanol. Dried samples were suspended in 0:5 ml of 1.5Mmethanolic HCl and incubated at 80° C. for 16 hr. Samples were driedunder a stream of nitrogen and washed twice with 0.25 ml of methanol.Dried samples were suspended in 0.1 ml of Tri-Sil (Pierce) and incubatedat 80° C. for 20 min. The 1 μl of samples were injected into a Varian4000 gas chromatograph mass-spectrometer (Varian 4000, Varian Inc. PaloAlto, Calif.) equipped with a 60 m VF-1 column. Helium was used as thecarrier gas at a constant flow rate of 1.2 ml/min. The oven conditionswere an initial temperature of 50° C. held for 2 min, temperatureincrease at 30° C./min to 150° C., then another increase at 3° C./min to220° C., which was held for two minutes. The injector temperature waskept at 250° C. and the MS transfer-line at 280° C. MS data acquisitionparameters included scanning from m/z 40 to 1000 in the electron impact(EI) mode or in the chemical ionization (CI) mode using acetonitrile.

The chromatography of 6Aα PS showed all the peaks that arecharacteristic of ribitol, rhamnose, glucose, and galactose (FIG. 7A),consistent with a previous publication. Kim et al., 347 Anal. Biochem.262-74 (2005). For instance, galactose yields three major peaksappearing between 11.2 and 11.6 min retention times with the second peakbeing the tallest. Kim et al. (2005). When 6Aβ PS chromatogram wasexamined, characteristic peaks of ribitol, rhamnose, and glucose werefound but galactose peaks was absent. When the areas of eachcarbohydrate peaks were normalized to rhamnose peak area and comparedbetween 6Aα and 6Aβ (FIG. 7B), 6Aα and 6Aβ PS have the equivalent areasof ribitol peaks. The glucose peak area of 6Aβ, however, was twice ofthat of 6Aα (FIG. 7B). This finding suggested that the repeating unit of6Aβ has one ribitol and one rhamnose as 6Aα but it has two glucosemolecules instead of one each of glucose and galactose molecules. Thus,6Aβ produces a capsular PS that is chemically different from the PSproduced by 6Aα by using glucose instead of galactose.

To further investigate the two glucose molecules presumed to be presentin 6Aβ PS, 6Aα and 6Aβ PS were treated with periodate, which selectivelydestroys vicinal glycols. As expected from the published structure of 6APS, the galactose and ribitol peaks of 6Aα PS became undetectable whilethe glucose and rhamnose peaks were undisturbed. Kamerling, Pneumococcalpolysaccharides: a chemical view, in Mol. Biol. & Mechanisms of Disease81-114 (Mary Ann Liebert, Larchmont, 2000); Kim et al., 347 Anal.Biochem. 262-74 (2005); Rebers & Heidelberger, 83 J. Am. Chem. 3056-59(1961). When 6Aβ PS was periodate-treated, its ribitol becameundetectable and its glucose peak was reduced by about half while itsrhamnose peaks remained undisturbed (FIG. 7B). This finding stronglysuggests that the 6Aα PS structure is identical to the 6A PS structurepublished in the literature. Also, it indicates that 6Aβ PS ischemically different from 6Aα PS and that 6Aβ PS has two glucosemolecules, one of which is sensitive to periodate and the other of whichis not.

Example 8 Determination of Monosaccharide/Ribitol Sequence within theRepeating Units

A mild alkali hydrolysis of 6A PS breaks the phosphodiester bond in eachrepeating unit and produces a repeating unit with a negative charge,which can then be examined with tandem mass spectrometry. The hydrolysisproduct of 6Aα PS (from strain SP85) showed three well-defined peakswith a negative charge: peaks with 683.21, 701.21, and 759.19 mass tocharge ratio (m/z) units (FIG. 8A). The peak at 683.21 m/z unitsrepresents anhydrous form of the peak at 701.21 m/z units and the peakat 759.19 represents the molecule with 701.21 m/z unit with NaCl salt.This indicates that the mass of the repeating unit is 683.21 mass unitsas described. Kamerling, 2000; Kim, 2005. The daughter ions (productions) of the 701.21 peak were examined and yielded daughter ions withmasses of 539.13, 377.08, and 212.99 m/z units, which respectivelycorrespond to the masses of glucose-rhamnose-ribitol-P,rhamnose-ribitol-P, and ribitol-P fragments (FIG. 8C). Also theiranhydrous counterparts at 701.21, 539.14, 377.08 and 212.99 m/z units.Additional peaks observed at 96.94 and 78.93 m/z units represent H₂PO₄ ⁻and PO₃ ⁻ ions (FIG. 8C).

Analysis of 6Aβ PS, using the same procedure used for 6Aα PS, showedthree major peaks at 683.24, 701.25, and 759.22 m/z units, correspondingto the three major peaks found for 6Aα PS (FIG. 8B). Also, the 6Aβcleavage products had a mass spectrum identical to those of 6Aα (FIG.8D). This finding indicated that the mass of the repeating unit of 6AβPS is 683.2 m/z units and that the carbohydrate sequence of the 6Aβrepeating unit is glucose 1-glucose 2-rhamnose-ribitol-P. (Todistinguish between the two glucoses, they are labeled as glucose 1 andglucose 2. Glucose 1 corresponds to the galactose of 6Aα) Thus, themonosaccharide sequence of 6Aβ is identical to that of 6Aα except forthe replacement of galactose with glucose 1.

Example 9 Determination of the Linkages Between Carbohydrate and Ribitolof the 6Aβ Repeating Unit

To identify the 6Aβ glucose that is periodate-sensitive, 6Aβ PS wasoxidized and reduced to repeating units by mild alkali hydrolysis, andthe repeating units studied with tandem mass spectrometry. Their massspectrum showed several major (and dominant) peaks between 650 and 700m/z units (FIG. 9A). The dominant peaks were at 655.23, 659.73, 661.24,664.25, 673.25, and 675.24 m/z units. Due to natural isotopes, eachdominant peak has satellite peaks with one or two additional mass unitsand these satellite peaks can be used to determine the charge states andthe true mass of the dominant peaks. Cole, Electrospray ionization massspectrometry: fundamentals, instrumentation, and applications (Wiley,New York, 2000). For instance, the dominant peak at 661.24 m/z units hasa satellite peak with 661.57 m/z units. Because these two peaks areseparated by 0.33 m/z units, the 661.24 peak represents a molecular ionwith three negative charges and 1983.72 mass units (i.e., threerepeating units with one water molecule; 655.23*2+673.76=1983.72).Similarly, the 664.25 and 675.24 peaks represented two repeating unitswith two negative charges, but the 675.24 peak has a sodium ionreplacing a proton. The 673.25 and 655.23 peaks represent one repeatingunit with one negative charge with or without a water molecule. Becausethe mass of the anhydrous repeating unit prior to oxidation/reductionwas 683.26, the repeating unit lost 28 mass units due to oxidation andreduction. To identify the periodate reaction products of ribitol andglucose, the ribitol fragment was named the Rx fragment and the twoglucose fragments were named the Gx and Gy fragments (FIG. 10A).

Daughter ions were obtained by fragmenting the parent ion with 673.25m/z units (FIG. 9B). During the fragmentation, one fragment may exchangeone atomic mass unit (AMU) with the other fragment. Grossert et al., 20Rapid Commun. Mass. Spectrom. 1511-16 (2006); McLafferty 31 Anal. Chem.82-87 (1959). Also, molecular ions become variably hydrated within argoncollision cells. Sun et al., 69 Infect. Immun. 336-44 (2001.). Indeed,the daughter ions could be grouped into hydrated and anhydrous peaksbased on differences of 18 m/z units (FIG. 9B). The peaks found at673.25, 581.16, 509.13, 347.07, and 200.99 m/z units are hydrated peaks,each of which has a corresponding anhydrous peak that is 18 AMU less.Also, the peaks at 200.99, 347.07 and 509.13 m/z units correspond to thefragments with 200, 346, and 508 AMUs with one hydrogen atom added tothe fragmentation site (FIG. 10B) during the fragmentation. The peak at200.99 m/z unit confirms that ribitol lost CH₂OH during the periodatetreatment. The peaks at 347.07 and 509.13 indicate that rhamnose andglucose 2 are periodate resistant. Presence of a peak at 581.16indicates that glucose 1 is cleaved.

Periodate cleavage divided glucose 1 into two parts (which were named Gxand Gy in FIG. 10A). The combined mass of the two parts is 164 insteadof 162 (mass of intact glucose) because glucose 1 lost no carbon butacquired two hydrogen atoms at the breakage site during the oxidationand reduction reactions. The mass spectrum shown in FIG. 9 is consistentwith Gx and Gy having 91 and 74 AMUs respectively. The peak at 581.16m/z units indicates that a repeating unit lost Gx and one extra proton(FIG. 10). Neutral loss of both Gx and Gy (74 AMUs) results inadditional loss of 72 m/z units because Gy already lost one hydrogen toGx and leaves one hydrogen with glucose 2. The same patterns were foundfor the anhydrous peaks: i.e., 655.22, 563.16, and 491.12 m/z units.Furthermore, when the 6Aβ PS was reduced with NaBD₄, the two additionalmass units were associated with glucose 1: the neutral loss of Gxfragment was 93 instead of 92, and that of Gy was 73 instead of 72 (FIG.4C). These findings clearly indicated that glucose 1 cleaves into Gx andGy with sizes shown in FIG. 10A.

The mass spectrum of daughter ions also provided information about theglycosidic linkages of 6Aβ PS. Glucose and rhamnose must be linked tothe preceding carbohydrate at their first carbon. Rebers & Heidelberger,1961. Also, they must be linked to the succeeding carbohydrate at thethird carbon in order to be resistant to periodate. Rebers &Heidelberger, 1961 Thus, 6Aβ PS must have glucose 1 (1→3) glucose 2(1→3) rhamnose (1→). Further examination of the daughter ions shows thattheir glucose 1 has the phosphodiester bond at its second carbon. To beperiodate sensitive, glucose 1 must have its phosphodiester link only atpositions 2, 4, or 6. The phosphodiester bond linkage is not at position6 because the linkage at 6 results in a loss of a carbon atom in glucose1 (FIG. 10F). If the phosphodiester linkage is at position 4, thebreakage occurs between the second and the third carbon. Gx and Gyshould then have 120 and 42 AMUs, and a peak with 552 m/z units shouldbe detected instead of the peak at 581 m/z unit (FIG. 10E).

Although hydrolysis cleaves the phosphodiester bond with glucose 1, itoccasionally breaks the phosphodiester bond with ribitol instead.Examination of this reverse cleavage products further confirms that thephosphodiester linkage must be at the second carbon of glucose 1. Thepeaks with 150.95 and 243.00 m/z units are reverse cleavage products ofglucose 1 (FIG. 9B) since products with these m/z units can be producedfrom glucose 1 with the phosphodiester bond at the second carbon (FIG.10D) and these peaks have one (150.95→151.97) or two (243.00→245.02)more m/z units if reduction was performed with NaBD₄ instead of NaBH₄(FIG. 9C). An ion at 120.95 m/z units can be also obtained if the ion at150 m/z units loses the terminal methanol group. These peaks cannot beexplained if the phosphodiester bond is at the fourth or the sixthcarbon (FIGS. 10E and 10F). Thus, the data with the reverse cleavageproducts also indicate that the phosphodiester bond is linked to thesecond carbon of glucose 1.

Additional examination of the mass spectra showed that therhamnose-ribitol linkage must be (1→3). Because pneumococci useCDP-5-ribitol that is produced for teichoic acid synthesis for theircapsule synthesis as well (Pereira & Brown 43 Biochem. 11802-12 (2004)),the linkage between ribitol and glucose 1 must be ribitol (5→P→2)glucose 1. The peaks at 78.94 and 96.94 correspond to PO₃ ⁻ and H₂PO₄ ⁻,while the peaks at 182.98 and 200.99 (FIG. 9B) correspond to the Rxfragment attached to PO₃ ⁻ and H₂PO₄ ⁻ (FIG. 10A). Thus, ribitol mustlose a hydroxymethyl group during the oxidation and reduction reactionand the linkage between rhamnose and ribitol must be rhamnose (1→3)ribitol. Considering all of the above, the 6Aβ repeating unit should be{P→2) glucose 1 (1→3) glucose 2 (1→3) rhamnose (1→3) ribitol (5→} (FIG.10C).

When 6Aα PS was analyzed, peaks identical to the 6Aβ PS peaks werefound, which indicate that galactose and ribitol were destroyed byperiodate but that glucose 2 and rhamnose remained intact. Thus, thestructure of 6Aα PS must be {→2) galactose (1→3) glucose 2 (1→3)rhamnose (1→3) ribitol (5→P), which is identical to the 6A PS structurepublished in the literature. Kamerling, 2000; Rebers & Heidelberger,1961. In summary, the only structural difference between 6A and 6C PS isthe orientation of the hydroxyl group at the fourth carbon of glucose 1(or galactose).

Classically, the phosphodiester bond of 6A PS was determined to be atthe second carbon of galactose by demonstrating that glycerol isreleased after a Smith degradation of the 6A PS that was oxidized andreduced. Rebers & Heidelberger, 1961. To confirm the position of the 6Aβphosphodiester bond using this classical approach, the Smith degradationof 6Aα and 6Aβ PSs after oxidation and reduction was performed asdescribed above. The reaction products of 6Aα and 6Aβ PSs indicatedglycerol from the two PSs. Thus, glucose 1 has a phosphodiester bond atthe second carbon of glucose 1.

Example 10 Genetic Origin of Serotype 6C

Bacterial strains and culture: The pneumococcal strains used in thestudy are listed in Table 4:

TABLE 4 List of pneumococcus strains Country of origin Strain namesSerotype Tissue location (year of isolation) Source or reference CHPA376C Nasopharynx USA (1999-2002) (18) CHPA388 6C Nasopharynx USA(1999-2002) (18) BGO-2197 6C Nasopharynx USA (1979) Payne et al (2006)MX-67 CMN 6C Bronchus Mexico (1996) (23) ACA-C21 6C Nasopharynx Canada(1995) (23) BZ17 6C CSF Brazil (2003) (15) BZ39 6C CSF Brazil (2003)(15) BZ86 6C CSF Brazil (2003) (15) BZ650 6C CSF Brazil (2003) (15)ST260 6C CSF Brazil (2003) This study KK177 6C Oropharynx Korea (2005)This study CH66 6C Nasopharynx China (1997) (23) CH158 6C NasopharynxChina (1997) (23) CH199 6C Nasopharynx China (1998) (23) CHPA67 6ANasopharynx USA (1999-2002) (18) CHPA78 6A Nasopharynx USA (1999-2002)(18) BZ652 6A CSF Brazil (2003) (15) KK58 6A Oropharynx Korea (2005)This study AAU-33 6A Blood USA (1998) (17) TIGR4JS4 Non-capsulatedderived from TIGR4* Not applicable (26) TIGR6A4 6A derived from TIGR4JS4Not applicable This study TIGR6AX Non-capsulated derived from TIGR6A4Not applicable This study TIGR6C4 6C derived from TIGR6A4 Not applicableThis study *TIGR4 was originally isolated from blood (25).

In addition to the 6C isolates from Brazil that were reported earlier(Lin et al., 2006), additional 6C strains were identified by retypingthe preexisting pneumococcal isolates archived in the laboratory as the“6A” serotype. The collection includes 6A isolates used for studies byRobinson et al., 184 J. Bacteriol. 6367-75 (2002); Mavvoidi, 2004; andPayne (Payne, 2006 submitted). One strain (BGO-2197) was isolated in1979 in Birmingham, Ala., USA. The TIGR4JS4 strain is a non-capsulatedvariant of the TIGR4 strain (Tettelin et al., 293 Science 498-506(2001)), and was produced by replacing type 4 capsule gene locus withJanus cassette (kan^(R)-rpsL⁺) and backcrossing 3 times to wildtypeTIGR4 (Trzcinski et al., 69 Micorbiol. 7364-70 (2003); Hollingshead(unpublished)). TIGR6AX, TIGR6A4, and TIGR6C4 are TIGR4JS4 variantsexpressing, respectively, no, 6A, or 6C capsule types. These variantswere produced as described below.

PCR and DNA sequencing: All the PCR primers used in this study arelisted in Table 5. The primers used for multi-locus sequence typing(MLST) were as described by Enright & Spratt, 144(11) Microbiol. 3049-60(1998), and the primers used to amplify the wciN, wciO, and wciP geneswere described by Mavroidi et al., 2004. Additional primers weredesigned using the DNA sequences of the 6A and 6B capsule gene loci inGenBank (accession numbers CR931638 and CR931639, respectively).

TABLE 5 List of PCR primers Primer Primer site of Source or nameNo. CR931638 Description* Sequence (SEQ ID NO) reference Forward primers5101  6949-6966 in wciN, for INDEL detection5′-atttggtgtacttcctcc (NO: 7) (17) 5103  8146-8168in wciO, for sequencing 6C 5′-aaacatgacatcaattaca (NO: 8) This studycapsule gene 5106  5897-5916 in wchA, for wciN detection5′-taccatgcagggtggaatgt (NO: 1) This study 5108  8350-8370in wciP, for wciP allele 5′-atggtgagagatatttgtcac (NO: 3) This studydetection 5112 Not applicable in Kan^(R)-rpsL⁺ with XbaI site5′-ctagtctagagtttgatttttaatgg (NO: 10) This study 5113  4870-4894in wze, for Fragment C 5′-gggaaaaataaaaaataggtcggg (NO: 11) This study5118  7613-7636 in wciO with BamHI site5′-cgcggatccagaaaaactatgtcgcctgctaaa This study (NO: 12) 5120     1-30in dexB, for Fragment A 5′-tgtccaatgaagagcaagacttgacagtag (26) (NO: 13)5122  2187-2206 in wzg, for Fragment B 5′-ttcgtccattcacaccttag (NO: 14)This study 5123  8775-8794 in wciP, for Fragment D5′-tgcctatatctgggggtgta (NO: 15) This study 5124 11274-11293in wzx, for Fragment E 5′-aatgatttgggcggatgttt (NO: 16) This study 512513864-13883 in rmlC, fur Fragment F 5′-agtgattgatgcgagtaagg (NO: 17)This study 5140  9531-9551 in wzy, for wzy allele5′-cctaaagtggagggaatttcg (NO: 18) (17) detection 5141 11459-11478in wzx, for wzx allele 5′-ttcgaatgggaattcaatgg (NO: 19) (17) detectionReverse primers  3101  7888-7905 in wciO, for INDEL and wciN5′-ccatccttcgagtattgc (NO: 2) (17) detections 3103  9468-9487in wzy, for Janus cassette and 5′-aacccctaacaatatcaaat (NO: 20)This study Fragment C 3107  9226-9245 in wciP, for wciP allele5′-agcatgatggtatataagcc (NO: 21) This study dectection 3112Not applicable in Kan^(R)-rpsL⁺ with BamHI site5′-cgcggatccgggcccctttccttatgcttttgg This study  (NO: 22) 3113 6203-6224 in wchA with XbaI site 5′-ctagtctagaaataaaatttcaatatctttccagThis study  (NO: 23) 3121  3676-3660 in wzd, for Fragment A5′-gattgcgattcactacg (NO: 24) This study 3122  5380-5361in wchA, for Fragment B 5′-aactccccaacaacctcatt (NO: 25) This study 312312978-12959 in rmlA, for Fragment D 5′-aaaatcsaggeaacgctatc (NO: 26)This study 3124 14618-14600 in rmlB, for Fragment E5′-acggagagcttgggttgta (NO: 27) This study 3126 17611-17584in aliA, for Fragment F 5′-caataatgtcacgcccgcaagggcaagt  (26) (NO: 28)3143 10135-10115 in wzy, for wzy allele detection5′-cctcccatataacgagtgatg (NO: 29) (17) 3144 12068-12049in wzx for wzx allele detection 5′-gcgagccaaatcggtaagta (NO: 30) (17)*Fragments A through F refers to the fragments of serotype 6C capsulegene locus used for capsule gene locus sequencing.

For capsule gene locus PCR, the reaction mixture had 10 to 30 ng ofchromosomal DNA, I μl of each primer from a 100-pmol stock, 2 μl of 10mM dNTP, 5 μl of 10× buffer solution, 0.5 μl (2.5 U) of Taq polymerase(Takara Biomedical, Shiga, Japan), and 39.5 μl of sterile water (Sigma,Saint Louis, Mich.). The reaction mixture for multi-locus sequencetyping had 10 to 30 ng of chromosomal DNA, 1 μl of each primer from a50-pmol stock, 2 μl of MgCl₂, 5 μl of Q-solution (Qiagen, Chatsworth,Calif.), 12.5 μl of Master Mix (Qiagen), and 4 μl of sterile water(Sigma). Chromosomal DNA was isolated with a Wizard Genomic DNAPurification Kit (Promega, Madison, Wis.) according to themanufacturer's instruction. Thermal cycling conditions were: initialdenaturation at 95° C. for 3 min, 30 cycles of denaturation at 95° C.for 1 min, annealing at 52° C.-58° C. for 1 min, extension at 72° C. for2 min, and a final extension at 72° C. for 10 min. Multi-locus sequencetyping used 30 cycles, and capsule locus gene PCR used 35 cycles. Thesize of the PCR products was determined by electrophoresis in a 1%-1.5%agarose gel.

The DNA sequence of the PCR products was determined by the genomics corefacility at the University of Alabama using an automated DNA sequencer,and the PCR products were purified with a Wizard PCR Cleanup Kit(Promega). DNA sequences were analyzed with Lasergene v. 5.1 software(DNASTAR, Madison, Wis.) and the Basic Local Alignment Search Tool(BLAST) located on-line at the NCBI NLM NIH site.

The sequences from the capsule gene locus were compared with thesequences previously reported. Mavroidi et al., 2004. Alleles of eachsequence type were assigned using the on-line pneumococcal Multi LocusSequence Typing (MLST) website. When the sequences were different, newallele numbers were assigned. All the wciNβ sequences are then depositedin the pneumococcal MLST. The entire capsule gene locus of thepneumococcal isolate CHPA388 is then deposited in GenBank.

Genetic profiles of 6C strains collected from global sources arepresented in Table 6:

TABLE 6 Genetic profiles of 6C strains collected from differentcontinents Multilocus sequence typing (MLST) Capsule gene locus Seq.profile Type No. Strains Country wciP wzy wzx aroE gdh gki recP spi xptddl (ST) 1 CHPA37 USA 9 (1) 10 (0) 1 (0) 1 13 1 43 5 TD* 20 — 2 CHPA388USA 9 (1) 10 (0) 1 (0) 10  13 1 43 98 1 20 1390 3 BGO2197 USA 9 (1) 10(0) 1 (0) 2 13 2 1 6 19 14 1092 4 ACA-C21 Canada 9 (0) 10 (0) 1 (0) 13 1 1 5 6 1 18 1715 5 MX67 Mexico 9 (0) 10 (0) 1 (1) 7 25 4 4 15 20 28 NT6 BZ17 Brazil 9 (1) 10 (0) 1 (0) — — — — — — — — 7 BZ39 Brazil 9 (1) 10(0) 1 (0) — — — — — — — — 8 BZ86 Brazil 9 (1) 10 (0) 1 (0)  7** 13 3 6 11 8 NT 9 BZ650 Brazil 9 (1) 10 (0) 1 (0) — — — — — — — — 10 ST260 Brazil9 (1) 10 (0) 1 (0) 1 5 9 43 5 1 6 NT 11 KK177 Korea 9 (0)  1 (0) 1 (0) 730 8 6 6 6 14 NT 12 CH66 China 9 (0) 10 (0) 1 (1)  7** 42 4 39 25 104 14NT 13 CH158 China 9 (0) 10 (0) 1 (1) — — — — — — — — 14 CH199 China 9(0) 10 (0) 1 (1) — — — — — — — — *TD means technical difficulties.Several attempts to isolate a bacterial clone and sequence xpt of thebacterial clone produced ambiguous sequences. **Numbers indicate thealleles.

Production of TIGR4 variants with 6A and 6C capsule gene loci: Toinvestigate the role of the wciN gene in 6C capsule expression, desiredgenes or gene fragments were inserted into the TIGR4JS4 strain, which isderived from TIGR4 but which has lost the capsule gene locus (FIG. 11).Aliquots of frozen, transformation-competent TIGR4JS4 were made bygrowing it in THY broth at 37° C. until the optical density at 600 nmwas about 0.4-0.5; by diluting it 1:100 in Todd-Hewitt broth (pH 7.2)supplemented with 0.5% yeast extract, 0.2% bovine serum albumin, 0.01%CaCl₂, and 13% glycerol; and by freezing it in 250 μl aliquots at −80°C.

To transform TIGR4JS4, a frozen bacterial aliquot was thawed and mixedwith 50 ng of competence-stimulating peptide variant 2. Trzcinski etal., 2003. After 14 min incubation at 37° C., 100 μl of TIGR4JS4 wasmixed with 10 μl of bacterial lysate (AAU33 strain) or 100 ng of DNAcassettes. After 2 hr incubation at 37° C., the bacteria were plated onsheep blood agar plates containing 200 μg/ml kanamycin or 300 μg/mlstreptomycin and incubated at 37° C. in a candle jar. Colonies oftransformants growing in the antibiotic media were harvested andbackcrossed three times with DNA-recipient competent bacteria.

To prepare a bacterial lysate of AAU33 for transformation, 10 ml of THYbroth was inoculated with the AAU33 strain and cultured for about 5 hrat 37° C. until the optical density at 600 nm was ˜0.4-0.5. The THYbroth was centrifuged to obtain a bacterial pellet, and the pellet waslysed by resuspending it in 0.1 ml of sodium citrate buffer (0.15M, pH7.5) containing 0.1% sodium deoxycholate and 0.01% sodium dodecylsulfateand then incubating it for 10 min at 37° C. The lysate (0.1 ml) was thenmixed with 0.9 ml of normal saline buffered with 0.015M sodium citrate(pH 7.0) and heat-inactivated at 65° C. for 15 min.

To replace the wciNα gene region of TIGR6A4 with the wciNβ gene regionfrom CHPA388, we prepared two different DNA cassettes, which are labeledCassette 1 and Cassette 2 in FIG. 11. Each cassette has three parts: thecentral core containing the target DNA and two flanking DNAs. The twoflanking DNAs are for homologous recombination, are about 1 Kb each, andwere obtained from either wchA or wciO-P genes. The central core ofCassette 1 has kanamycin-resistance (kana^(R)) andstreptomycin-sensitivity (rpsL⁺) genes and is obtained by PCR usingTIGR4JS4 strain DNA as the template. The flanking DNA fragments wereobtained by PCR using chromosomal DNA of AAU33 as the template. All theprimer pairs, which are shown in FIG. 11 and Table 5, have restrictionenzyme sites to facilitate linking the three DNA fragments. The threeDNA fragments were linked together by digestion with an appropriaterestriction enzyme and ligation with T4 DNA ligase (New England BioLabs,Beverly, Mass.). The ligation product was amplified by PCR using primers5113 and 3102. The PCR product was purified by the Wizard PCR CleanupKit (Promega) and subjected to nucleotide sequencing. The PCR productwas then used as donor DNA in the transformation.

Cassette 2 was used to replace the antibiotic selection genes with thewciNβ gene. The central core has the wciNβ gene from CHPA388. The wchAand wciO-P DNA fragments were obtained by PCR from AAU33 as describedfor Cassette 1 (FIG. 11).

Identification of additional 6C strains among “6A” collections: Toobtain a representative collection of 6C serotypes from variouslocations, we re-tested our preexisting collection of “6A” strains byquellung reaction (Mavroidi et al., 2004; Robinson et al., 2002) andidentified nine additional 6C isolates from five countries on threedifferent continents (Table 4). These isolates were obtained from spinalfluid, blood, and the nasopharynx samples, indicating that 6C can beassociated with invasive pneumococcal infections as well as asymptomaticcarriage. One isolate (BG02197) was obtained in 1979 at Birmingham, Ala.This finding shows that the 6C serotype, identified and isolated for thefirst time as described herein, may have been in existence for more thantwenty-seven years and is now found throughout the world.

Many 6C strains have the identical capsule gene locus profile butdifferent sequence types: To begin investigating the genetic basis forthe serotype 6C, the capsule gene locus profiles and the sequence types(STs) of the twelve isolates were examined. Similar to what was observedpreviously for the Brazilian 6C isolates (Lin et al., 2006), all 6Cisolates have allele 9 of the wciP gene with either no or one nucleotidedifference. Similarly, all 6C isolates have allele 1 of the wzx genewith either no or one nucleotide difference. All 6C isolates have allele10 for the wzy gene except for one isolate, which expresses allele 1. Incontrast to the 6C isolates' restricted capsule gene locus profile,multi-locus sequence typing shows that 6C isolates express diverse STs.The fact that 6C is associated with multiple STs but with one singlecapsule gene locus profile (except for one isolate) suggests that thegene(s) responsible for the 6C serotype is probably in the capsule genelocus.

The capsule gene loci of 6A and 6C differ in the region between the wchAand wciO genes: It was hypothesized that the genetic difference betweenserotypes 6A and 6C is a glycosyl transferase gene, the same gene thatis responsible for the difference between serotypes 6A and 6B. When PCRwas used to compare the sizes of their glycosyl transferase genes, itwas found that the sizes of their wciN genes were different. The wciNPCR products of all 6C isolates were about 1.8 kb long whereas the wciNPCR products of all 6A isolates were about 2 kb long (FIG. 12). Todistinguish between the two wciN genes from the 6A and 6C serotypes,they have been named wciNα and wciNβ, respectively.

To further investigate wciNβ gene, the DNA sequences of the wciNβ generegion including the wchA and wciO genes from five 6C strains. (BZ17,BZ86, CHPA388, KK177, and ST 260) were compared. Because their sequenceswere almost identical, the actual DNA sequence is shown for only CHPA388(FIG. 13) and the sequences of other isolates are deposited in GenBank.The sequence of the wciNβ gene from CHPA388 was then compared with the6A sequence of the corresponding region available at the GenBank (no.CR931638) (FIG. 13). A summary of the comparison is shown in FIG. 14.

The sequence comparison revealed clear differences in wciNα and wciNβgenes: The 6C serotype has 1029-bp-long DNA in place of 1222-bp-long DNAin 6A (FIG. 14 and FIG. 15). The two wciN genes are completelydifferent, with the sequence homology being only about fifty percent.The DNA difference begins immediately after the termination of wchA gene(position 1368) and ends 130 bases upstream to the beginning of the wciOgene (positions 239.8 for 6C and 2631 for 6A) (FIG. 14 and FIG. 15).When the DNA sequences flanking the replaced gene were compared between6A and 6C, significantly more DNA polymorphisms were found in theflanking regions than in the regions outside of the two flankingregions. For instance, the 300 bases upstream from the replaced genehave 25 different nucleotides, but the 150 bases located immediatelyupstream from the 300 bases have only one different base (p<0.001 byFisher's exact test) (FIG. 15). Similarly, in the 3′ direction, 20 basesdiffer in the proximal 110 bases but only 1 base differs in the next 300bases. (p<0.001 by Fisher's exact test) (FIG. 15). These findings arenot unique to this particular 6A sequence (CR931638) because similarresults were obtained with the sequence of 7 different 6A strains AAU33,D020-1B, HS3050, CHPA78, KK65, ST19, and ST558. These findings suggestthat the two flanking regions were parts of the new gene that has beeninserted into 6A to create 6C.

The flanking regions may have been involved in the homologousrecombination of the wciNβ gene to the 6A capsule locus. Furthermore,all 6C isolates have the identical flanking region sequences. Thissuggests that the genetic replacement took place only once and that allthe 6C isolates must be progeny of this single founder bacterium.

With this gene replacement, wciNβ has a new open reading frame (ORF)that is 1125 bases long and encodes a peptide with 374 amino acids,which is named the WCINβ protein (FIG. 13). The termination codon of thenew ORF is between the two potential start codons for the wciO gene,which are located at positions 2497 and 2528 of 6C. When the sequence ofthe wciNβ gene was compared with the sequences in the database, 110bases (from 1627 to 1736 in 6C) of 6C demonstrated 81% homology to the90 bases of the exopolysaccharide synthesis gene of Streptococcusthermophilus strain CNRZ1066 (Bolotin et al., 22 Nat. Biochem. 1554-58(2004) (FIG. 13). Also, the translated sequence of wciNβ gene has 22%amino acid identity and 44% similarity to the translated sequence ofcapH gene of Staphylococcus aureus. Lin et al., 176 J. Bacteriol. 700516 (1994). The wciNβ gene product is a member of the waaG family.Incidentally, the waaG gene product of E. coli K-12 is anα1,3-glucosyltransferase involved in LPS synthesis. Heinrichs et al., 30Mol. Microbiol. 221-32 (1998).

The sequences of the capsule gene loci of the 6A and 6C serotypes differonly slightly in regions other than the wciN gene: To determine if the6A and 6C capsule gene loci differ only in the wciN region, the sequenceof the entire capsule locus of a 6C isolate (CHPA388) was analyzed byPCR amplifying the entire capsule gene locus between dexB and aliA lociin six overlapping DNA fragments. FIG. 16 shows the genetic map of thesequence of the capsule gene locus. The entire CHPA388 locus ispresented in FIG. 17. The 6C capsule gene locus contained fourteen ORFsinvolved in the capsular PS synthesis. The ORFs are in the sametranscription orientation and correspond exactly to those found for the6A capsule gene locus. The ORFs of 6C begin with cpsA gene at the 5′ endand end with rmlD gene at the 3′ end. As shown in FIG. 16, a putativepromoter binding region and a transcription start site for 6C capsulegene locus are found 5′ to the cpsA gene and a putative transcriptionterminator site is found 3′ to the rmlD gene. Additionally, there areinsertion element (or “trip” or “transposase”) sequences at both ends ofthe capsule gene locus, as are commonly found for many pneumococcalcapsule gene loci. Bentley et al., PLoS Genet. 2:e31 (2006). Thenucleotide sequence of the entire locus are deposited in GenBank.

When the sequence was compared with the capsule gene locus of a 6Astrain (GenBank accession No. CR31638), except for the wciN regiondescribed above, the capsule gene locus of 6C was very homologous (˜98%)to that of 6A. Also, homology was significantly low (about 78%) forabout 60 by in the middle of the cpsA ORF and the “tnp's” found ateither end of the capsule gene loci were different between the 6A and 6Ccapsule gene loci. The 6C capsule gene locus did not have the INDEL thatis present upstream to the wciO gene in some 6A or 6B capsule gene loci.Mavroidi et al., 2004. Despite these differences, the most prominentdifference between 6A and 6C capsule gene loci is found in the wciNregion.

The wciN gene region is responsible for conversion from the 6A to 6Cserotype. Although the above comparison of the capsule gene loci showedthat the major difference is in the wciN region, minor differences arepresent in the entire capsule gene region (e.g., cpsA ORF). It ispossible that some other small genetic differences outside of thecapsule locus could be involved in the 6C expression. To show that onlythe wciN region is involved, whether the interchange of the wciNα regionwith the wciNβ region could convert the 6A serotype to the 6C serotype(FIG. 11) was examined. TIGR6A was produced by replacing the capsulelocus of TIGR4 with the 6A capsule gene locus from strain AAU33. ThewciNα gene was then removed from TIGR6A by transforming it withCassette 1. The resulting strain, named TIGR6AX, was non-capsulated andwas found, via PCR, to have lost the wciNα gene between positions 1325and 2518. The wciNβ region was then inserted into TIGR6AX using Cassette2, which contained the wciNβ gene from CHPA388. PCR confirmed that theresulting strain, TIGR6c, had wciNβ at the expected location. TIGR6c wasfound to express serotype 6C and this confirmed that the wciNβ generegion is sufficient for the serotype conversion.

The invention claimed is:
 1. A composition comprising a purifiedpolysaccharide comprising the repeating unit {→2) glucose 1 (1→3)glucose 2 (1→3) rhamnose (1→3) ribitol (5→phosphate}.
 2. The compositionof claim 1, wherein said polysaccharide is conjugated to a carrier. 3.The composition of claim 2, wherein the carrier is a protein.
 4. Thecomposition of claim 2, wherein the carrier is a bead.
 5. Thecomposition of claim 1, wherein the polysaccharide is produced by abacterium expressing the capsule gene locus having the nucleotidesequence of FIG. 17 (SEQ ID NO:43).
 6. The composition of claim 1,wherein the polysaccharide is produced by an isolated Streptococcuspneumoniae 6C, wherein said Streptococcus pneumoniae 6C has a capsularpolysaccharide having the repeating unit {→2) glucose 1 (1→3) glucose 2(1→3) rhamnose (1→3) ribitol (5→phosphate}.